Gene Arth_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1187 
Symbol 
ID4446318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1287471 
End bp1289111 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID639688994 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_830681 
Protein GI116669748 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.208279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA GCCACTCCCG CCCCCAGGCG CCGCAACAGG CCCTGACGGA CCCCGTCCGC 
AACCCGGCGA ACGCCCCGGC GCCCGTCAGG ACCAAACGCG CGTTCGTCCT CCTGTTGCTG
ACGCTGTTCG TCCCGGGCAG TGCCCAGATC GTCGCCGGCG ACCGGAAGCT CGGAAGGATC
GCCCTCCGGG TGACCCTCAC TGTCTGGGGA CTGGCGCTTG CGGGACTGGT GCTGCTGCTG
GTGAACCGCA CCCTCCTGAT CGGCATCCTC ACCAACACGG TGGCCTCACT CCTGATCATC
GTCGTCCTCA TTGCGCTCGC ACTCGGCTGG GCTGCGCTGT TCGTCAACAC CCTCAGGCTG
ATCCGGCCGG TCCTGCTGGC ACCCGGAATG CGGCCGGTCG TCGGCGTCGC ACTGGTCCTG
GCTATGCTGC TCAGCAGCGG GACACTCGGC TACGCCGCCT ACGTTCTGAA CGTGAGCCGG
AACGCGATCG GCAGCATCTT CTCGGCGGGC CCTGCAATCG ACCCCGTGGA CGGCCGCTAC
AACTTCCTGA TGATGGGCGG CGACGCCGGC GACGACCGCA CTGGCCGGCG CCCGGACAGC
CTCTCCGTCC TCAGCGTCGA CGCCAAGACA GGCCAGACAG CCATCATCTC GGTGCCGCGC
AACCTGCAGA ACGCACAGTT CAGCGAGGGT TCCCCCATGC GGCAGATCTA TCCGGACGGC
TACGACTGCG GCAACGAGTG CCTCATCAAC GCGATCAACA CCGAAGTGAC CAACGAGCAC
GCGGACCTCT ACCCCGGCGT CGCCGATCCC GGGGCCCAGG CAACCCTCGA GGCTGTCTCG
GGTACGCTCG GAATCACCGT CCAGGCCTAC GTCCTGGTGG ACATGGACGG CTTCGCCAAG
CTCATCGACG CCATGGGCGG CATCAAGATC AAGGCCGGCG GGTGGGTGCC GCTGAGCGGC
GACATGGTGG ACGAGGCCAA CGGCATCCAC GGAATGCCCC TCGGCTGGAT CCCGGCCGGT
GAACAGCACC TCAACGGCTA CCATGCCCTC TGGTACGGCC GCTCCCGGGA ATTCGTTGAC
GACTACGCAC GCATCCAGCG TCAGCAATGC GTCCAGCAGG CCATGCTGAA GCAGCTGGAC
CCCGCAACGC TGCTCTCCAA GTTCGAAGAC ATCGCCAACG CGGGCACCAA GGTGGTTGAC
TCCAACATTT CCGCGAGCCA GCTCGGCAGC TTCGTGGACC TGGCCATGAA GGCCAAGGGC
AAGGAAGTCA GCCGGCTGAC CATTGGACCG CCGGACTTCG ACGCATCGTT CTCCACGGTG
CCGGACTTCA ACCAGATCCA CGACAGGGTC GACCAGCTGC TGGCCGCGCA GTCCGAGTCG
GCGGGAGCCG CGGGTAATCC TGCCGGGGAG GACAGCATCG TGCAGGCCGG CGCGGCCGCG
GGCCCCCTGA TGGCGGCCGC ACCGGCGGCG CCCCTTACCC AGCCGGCACC GTCGCCGTCG
TCGTCGGACT TCACGCCAGT GACCACCACC CCCGACGGCG AGCCCATCAC GGAAGAGATG
CTCAACCAGT TCAAGCGTGA GGGCAACGAG CAAGCGATCC GCGACCTTGT GGCCACGAAC
GGCCAGTGCC GCCCGCTGTA A
 
Protein sequence
MTTSHSRPQA PQQALTDPVR NPANAPAPVR TKRAFVLLLL TLFVPGSAQI VAGDRKLGRI 
ALRVTLTVWG LALAGLVLLL VNRTLLIGIL TNTVASLLII VVLIALALGW AALFVNTLRL
IRPVLLAPGM RPVVGVALVL AMLLSSGTLG YAAYVLNVSR NAIGSIFSAG PAIDPVDGRY
NFLMMGGDAG DDRTGRRPDS LSVLSVDAKT GQTAIISVPR NLQNAQFSEG SPMRQIYPDG
YDCGNECLIN AINTEVTNEH ADLYPGVADP GAQATLEAVS GTLGITVQAY VLVDMDGFAK
LIDAMGGIKI KAGGWVPLSG DMVDEANGIH GMPLGWIPAG EQHLNGYHAL WYGRSREFVD
DYARIQRQQC VQQAMLKQLD PATLLSKFED IANAGTKVVD SNISASQLGS FVDLAMKAKG
KEVSRLTIGP PDFDASFSTV PDFNQIHDRV DQLLAAQSES AGAAGNPAGE DSIVQAGAAA
GPLMAAAPAA PLTQPAPSPS SSDFTPVTTT PDGEPITEEM LNQFKREGNE QAIRDLVATN
GQCRPL