Gene Daro_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0866 
Symbol 
ID3569853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp937380 
End bp940490 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content57% 
IMG OID637679324 
Producthypothetical protein 
Protein accessionYP_284092 
Protein GI71906505 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000189668 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00567814 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGAAA CTAATCACAT CAAGTTCAAA AAACGTGCGG CAGCGCTCGC TGTCGCATCA 
TGCATGTCGC TCGCACCATG GTTTGCCGAG GCCGCAGGGC TAGGCAAATT GACCGTGCTA
TCCGCGTTGG GGCAGCCCCT GCGGGCAGAG CTCGATATCG GCGCCACCAA GGATGAGGTG
GCCGGTATGA CCGCTCGTCT TGCCCCGCAG GACGCCTTCA AGCAAGCTGG AGTCGATTTC
GCCACTGTCC TGCTTGATCT TCGTTTTTCG GTTGAAAAGC GCGCGAATGG ACAGTCTGTC
GTCAAGGTGT CATCGGCCAA GCCGATCAAT GAGCCTTTCC TCGACTTTCT TGTCGAATTG
AACTGGCCGG CTGGTCGTCT TGTTCGCGAA TATACCTTCC TGCTTGATCC GCCTGAAATT
GCTGCCAGCC AGGCGTCACG CCCGGTTGCG GACGCGCGGA TTGTCGAAAC GGTACGGGGT
GGAGGCGCTG CCGAAGACAG GCCAGTGTCG GTCAAGGCTG CTGCATCGCG TCCGGCACCT
GCGCCCAAGG TCGCGGCTGA GCCCAAGGCG GCAGCAGAAA ATAGTGGAAC CCGGGTTGTT
CGGCAGGGCG AGACGCTGCG CAAGATTGCC GATGAAAGCA AATACGACGG CGTGTCCCTT
GAGCAGATGC TGGTTGGCTT GTTCCAGAGT AATCCGGATG CCTTTATCGG GCAAAATGTT
AACCGTCTGA AATCAGGCGC TATTCTGAAT ATTCCGGAAA AATCTGCTGT TGAGGCAGTT
TCTCCAAAAG AAGCCAAGAA GATTTATGTT GCCCAGGCGG CGGACTGGAA TGCCTATCGC
CAGAAATTGG CAGCATCTAC AGCCAAGACG CCGGCCAAGG ACGAAACCTC GGGAGCCCAG
GCCAGTTCCG GCAAGATCAC GGCCAAGGTA GAGGAAAAGG CAGTGCCTGC CGAGCAGTCC
AAGGATCAGC TCAAGGTGGC ACGTGCCGAT GCTGCAGCCA AGGGCGCTGC CGCCGCTAAG
GCAGCTGAAG CGGCTGACCA GATTGCCAAG GATAAGGCGC TGAAGGAAGC GCAGGATCGG
ATGGCAACTC TTGAAAAGAA TGTGAATGAG TTGCAGAAGC TTCTGGAAAT GAAAAACCAG
AAGCTGGCTG AACTTCAGCA ACCTCCGGTC AAGAAGGAAG AGCCCAAGGC GCCTGAGGTG
GTCAAGCCAG TCGAGCCGCC AAAGCCTGTC GAAATTGCCA AGCCAGCCCC AGCGGTTGAG
GAGGCGCCGA AACCGGTTGA GCCGCCGAAG GTGGTTGAAG AGACCAAGCC AGTTGAGCCG
GTCAAGCCGG TCGAACCGCC TAAGCCGGAG GCCCCCAAGC TTGAGGAGAA ACCGAAGGTG
GTCGCTCCGC CGCCGGCTCC GGCGCCGGTT GTCGAGGAGT CGTTGCTTGA TGACCCGCTG
CCGCTGGTTG GCGGTGGTGG TATTCTGGCT TTGCTGGCTG GCTATTTCCT GTTCAAGCGT
CGTCGTACGC AGAGCTCCGC CTTGGAAACA ACCGCTGCAC CGATGCCATC CAGTCTTGGT
CCAAATTCCG TGTTCCGGAT GACTGGTGGG CAAAGCGTCG ATACGGGGAA TACGCCGCCA
CAGACCGGTG AGTTCAGTCA GACAGGGCCC GGCACCATTG ATACCGATGA AGTTGATCCG
GTGGCTGAGG CGGACGTCTA CATGGCCTAT GGTCGCGATA CCCAGGCTGA AGAAATTCTC
CTCGAGGCCT TGCAGAAGGA TCCGCAACGG ACGGCGATTC ACGCCAAATT GCTCGAAATT
TACGCCAACC GGCACAGCCT GAAACAGTTT GAAACCTTGG CCAGCGAGCT GTATGCGCAG
ACCGCTGGTG TTGGGCCTGA TTGGGAAAAA GTTGCCGCTT TGGGCGTAGG CCTTGATCCA
AGCAATCCGC TATATTCCAG CTCACGTACG TCGGATGTCT CTGCCGCAAC GATGGTGGCT
CTCCCTGTAG AGGAACTGGA GCTGCCGGAG GCAGTGCCTG AAGCGTTATC GCCGCTGGTG
GGAGGGGTTG AGCCGCTGCC GGAGATTGGT TTGGCCGATG AATTCGAGCG GGATACGTTG
GTCCTGCCGA AGGAAACCGA TGGCCATGAG GCGCTGGGCG AGGATGTTTC CGTGGCTGAT
GAGCTGGCAT CGGGTTCTGA TGCCATGACG CTTGACTTTG ATCTTGGCGA ACAGACTATC
GCTCCCGAGG TCAAGGGCGT GGCTGAGACC TCGCTGGCCG ATGTCAGTGC GGATCTCGAA
TCGCCGATCA GCATCGATTC CAGTGCACTT GATTTCGATC TCGGTAACGA TGTTCAGGAG
CCCGAGATTG TCGCGACGCT CGTCGGTGAC CAAGGTGTAT TTGGTGATCT TGCGACCGGC
GTAGATCTTG ATTTCAACGC CGCCGAAAAG CAGTTGTTGG CGCCAGAGCA ATCGGTCATT
CCAACCCCGG ACTTCTCTCC GGAAGGTACC TTGGTCATGC CGTCGGCCAC TGACAACGAA
GACCTTGATG TTGGCCTGGG TACTTGGGTT GGTGCTGATG GCGAGCCCGG TATCTCGACG
AATGAGGAGG CTTCGGCCGG CAGTGGCGGA TCTGTTGATA CGGATTCGCT GATGTCTCAG
ACCATTGTCA ATCCGATGGC TGGAACGGAT ACGCTGCTTA GTTCTGATAT CCTGAGCTTT
GGTGGAGAAA CCGATCATCC TCAACTGTCG AGCACTGTGG TTAACTCGGG TGTGGTCGAT
GCTGACTCTC TCGAATTTGA CGTCAAGCTG ACTGATTCCA TGTTCCTTGG GCAACCGATG
ATTCCGCCTG AGTTCGATAT CGGATCAATC AATCTTGATT TGGCGGCCGA ACCCGCAGAA
CCTTCTGTTT CGCCAGTCGA AGTTCCAGCC GTGGCAGAAG CTCCGCTCGC TGCTACAGCG
ACCCATGATG CGCAGTGGGA AGAGGTTAAT ACCAAACTTG ATCTGGCCAA GGCCTATGAA
GAAATGGGTG ATCTCGAGGG AGCTCGCGAA CTTTTGCAGG AAGTGGTAGG CGAAGGGTCG
GTTGATCTGG TTGAGCAGGC ACGTACGATA CTCGGTCGAA TAGGCGGGTA G
 
Protein sequence
MNETNHIKFK KRAAALAVAS CMSLAPWFAE AAGLGKLTVL SALGQPLRAE LDIGATKDEV 
AGMTARLAPQ DAFKQAGVDF ATVLLDLRFS VEKRANGQSV VKVSSAKPIN EPFLDFLVEL
NWPAGRLVRE YTFLLDPPEI AASQASRPVA DARIVETVRG GGAAEDRPVS VKAAASRPAP
APKVAAEPKA AAENSGTRVV RQGETLRKIA DESKYDGVSL EQMLVGLFQS NPDAFIGQNV
NRLKSGAILN IPEKSAVEAV SPKEAKKIYV AQAADWNAYR QKLAASTAKT PAKDETSGAQ
ASSGKITAKV EEKAVPAEQS KDQLKVARAD AAAKGAAAAK AAEAADQIAK DKALKEAQDR
MATLEKNVNE LQKLLEMKNQ KLAELQQPPV KKEEPKAPEV VKPVEPPKPV EIAKPAPAVE
EAPKPVEPPK VVEETKPVEP VKPVEPPKPE APKLEEKPKV VAPPPAPAPV VEESLLDDPL
PLVGGGGILA LLAGYFLFKR RRTQSSALET TAAPMPSSLG PNSVFRMTGG QSVDTGNTPP
QTGEFSQTGP GTIDTDEVDP VAEADVYMAY GRDTQAEEIL LEALQKDPQR TAIHAKLLEI
YANRHSLKQF ETLASELYAQ TAGVGPDWEK VAALGVGLDP SNPLYSSSRT SDVSAATMVA
LPVEELELPE AVPEALSPLV GGVEPLPEIG LADEFERDTL VLPKETDGHE ALGEDVSVAD
ELASGSDAMT LDFDLGEQTI APEVKGVAET SLADVSADLE SPISIDSSAL DFDLGNDVQE
PEIVATLVGD QGVFGDLATG VDLDFNAAEK QLLAPEQSVI PTPDFSPEGT LVMPSATDNE
DLDVGLGTWV GADGEPGIST NEEASAGSGG SVDTDSLMSQ TIVNPMAGTD TLLSSDILSF
GGETDHPQLS STVVNSGVVD ADSLEFDVKL TDSMFLGQPM IPPEFDIGSI NLDLAAEPAE
PSVSPVEVPA VAEAPLAATA THDAQWEEVN TKLDLAKAYE EMGDLEGARE LLQEVVGEGS
VDLVEQARTI LGRIGG