Gene Daro_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3944 
Symbol 
ID3567482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4240583 
End bp4242652 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content62% 
IMG OID637682418 
ProductTonB-dependent receptor 
Protein accessionYP_287142 
Protein GI71909555 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.677577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA GAAAAAAGCT AATAAGTTCC TATTTTCTGG CGGCCGTGAG CAGCGTCGGA 
GCACATGAGT CGGCTGAGCT TTCTACGGTC GAAGTGCGGG CCAAGGCCGA AAATCTGGAA
GGCATCGCCG CGTCCGGCAG TGAAGGTGTG GTTTCCAGCC AGCGCCTGGC GGCCGTGCCC
ATCCTGCGCC CCGGCGAGGC GCTGGAAATG GTGCCCGGCC TGATCGTTAC CCAGCACGCC
GGTGACGGCA AGGCCAACCA GTATTTTCTG CGCGGCTTCA ACCTCGACCA TGGTACCGAT
TTCGCCACTT ACGTCGGTGG CGTCCCGGTC AACATGCCGA CCCATGCTCA TGGCCAGGGC
TACACGGATC TCAATTTCCT GATTCCCGAG CTGGTCGATC GCATCAGCTT CCGCAAGGGG
CCGTACTTCG CTGAGGAAGG CGATTTCTCA TCGGCTGGCG CAGCGCATAT CGACTATTTC
CGCCAGATGG ATGCGTCGCT GGCGCAAATC ACGCTGGGCC AGAAGGGCTA TGTGCGCAAC
CTGTTGGCTG GCTCGCCGGA AGTGGGCCGC GGCAATCTGC TCTATGGTCT GGAATTGTTC
CATAACGACG GACCGTGGGA AGTGGCCGAG AATTACCGCA AGTTTAATGG CGTGCTGCGC
TATAGCCAGG GAACACGCAA CGACGGCTTT TCGCTGACCG GCATGGCCTA CCGGGGCCAA
TGGACGTCCA CCGACCAAAT TGCCCAGCGC GCCATCGACC GCGGGCAGGT CGGCCGCTTT
GGCACCCTCG ATTCGACGAC CGGTGGCGAG ACCAGTCGCT ACAGCCTGGC CGGTGAATGG
GCCAGGCGCT GGGCCAATGC CCAGACCAAG GCCAACGTCT GGTGGCTGAA GTCCAGCCTC
GATCTGTGGT CCAATTTTCA ATATTGCCTG AACGATGTCG CCCACAGTGG CACCTGTGAC
ACCGGCGACC AGTTCAAGCA GGGCGAGCGC CGTCAGTCGG GGGGCTTTGC GCTGTCGCAT
GCGATGTTCG ATCGCTGGGG CGCTTTCGAG GTCGAGAACA GCATCGGCCT GCAGAGCCGG
ATCGACCGTC TTAACCCGGT TGGTCTCTAC GCCACCTCGG CCCGGCAGAC CGTCGGCACT
GTCCGCGAAG ACAAGGTGAC CCAGCGCAGT CTGACGCTGT GGGGGCAAAA CGAAACGCGG
TGGACCGAGT GGTTCCGTTC GGTCCAGGGC TTGCGTGCCG ATGCCTACGA TTTCGACGTC
GAATCGAGCC TGGCTGCTAA TTCCGGCAAG GCCAGCGACC AGATGGTGAC GCCCAAACTG
GCGCTGATTT TCGGCCCGTG GCAGAAAACC GAGCTCTACC TGAATTACGG CCACGGCTTC
CATTCAAACG ATGCGCGCGG GACGACGATC AAGGTCGATC CAGCTGACGG CACGACGCCA
GTACAGCGCG TCAAGCCGTT GGTCCGCACC AAGGGTTACG AAGTGGGTGC GCGCAGCGAA
CCGGTCGCCG GCTGGCAATC GACGCTTGCC TTGTGGCAAC TCGATGCCGC CTCCGAACTG
CTCTTTGTTG GCGATGCCGG CACCACCGAA CCATCACGCC CGTCGCGCCG CTATGGCCTT
GAGTGGACCA ACCTCTATGT ACTCTCCGAT TGGCTGGCGA TCGATGCCGA CCTCGCCTGG
TCGCATGCCC GTTTCCGCGA TCAAGATCCG ACGGTCGGCG ACTACATTCC CGGTGCGGTG
GCGACGACTG CCAATATCGG CCTGACGTTC GATCACCTCG GCCCGTGGTT CGGGGCCTTG
CGCCTGCGCT ATTTCGGGCC GCGTCCGCTG ATCGAAGACA ACTCCGTGCG CTCCGGCAGT
TCGGCACTGA CCAACCTGCG TACCGGCTAC AAGATCGACC AGCGTACCCA GTTGACCCTC
GACGTCTACA ACCTGTTCGA TCGCAAGGAG AACGACATCG AATACTGGTA CGACTCGCAA
CTGCGGGGAG AGGGCACCTC GGCCAGCGAT CGCCATATTC ACCCTGCCGA ACCGCGTAGC
CTGCGCCTGA CCATCTCCCA TCGCTTCTAA
 
Protein sequence
MSVRKKLISS YFLAAVSSVG AHESAELSTV EVRAKAENLE GIAASGSEGV VSSQRLAAVP 
ILRPGEALEM VPGLIVTQHA GDGKANQYFL RGFNLDHGTD FATYVGGVPV NMPTHAHGQG
YTDLNFLIPE LVDRISFRKG PYFAEEGDFS SAGAAHIDYF RQMDASLAQI TLGQKGYVRN
LLAGSPEVGR GNLLYGLELF HNDGPWEVAE NYRKFNGVLR YSQGTRNDGF SLTGMAYRGQ
WTSTDQIAQR AIDRGQVGRF GTLDSTTGGE TSRYSLAGEW ARRWANAQTK ANVWWLKSSL
DLWSNFQYCL NDVAHSGTCD TGDQFKQGER RQSGGFALSH AMFDRWGAFE VENSIGLQSR
IDRLNPVGLY ATSARQTVGT VREDKVTQRS LTLWGQNETR WTEWFRSVQG LRADAYDFDV
ESSLAANSGK ASDQMVTPKL ALIFGPWQKT ELYLNYGHGF HSNDARGTTI KVDPADGTTP
VQRVKPLVRT KGYEVGARSE PVAGWQSTLA LWQLDAASEL LFVGDAGTTE PSRPSRRYGL
EWTNLYVLSD WLAIDADLAW SHARFRDQDP TVGDYIPGAV ATTANIGLTF DHLGPWFGAL
RLRYFGPRPL IEDNSVRSGS SALTNLRTGY KIDQRTQLTL DVYNLFDRKE NDIEYWYDSQ
LRGEGTSASD RHIHPAEPRS LRLTISHRF