Gene Ajs_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3236 
Symbol 
ID4671947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3419322 
End bp3422078 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content68% 
IMG OID639840276 
Productputative transmembrane protein 
Protein accessionYP_987435 
Protein GI121595539 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTTT TTACATATTT TCACGCGCCT GGGCGGCTCG ATCACGACCA GAAAAATACC 
GCCCCCTCGG CACCATCCAT CTCTTCATTC GTCGCACACA TGCATCGCTG GAAATTTTCT
GTCCTGGCCA CCGCGGCCAT CCTCTCGGCT GGTCTTTACA CCACCGATGC AAGTGCTCTC
GCCCTGGGCC GCGTGAATGT CCAGTCGGCG CTTGGCGAGC CCCTGCGCGC GGAAATCGAA
CTGCCCCAGA TCACTGCTGC CGAGGCCGAA TCGCTGCGCG TGAGCACCGC CAGCCCTGAA
GTTTTCCGCA GCCAGGGCAT GGAATATTCC CCCGCTGCGC ATAGCGTGCA AGTGCAACTG
CACCGCCGCA CGAATGGCTC CATGGTGCTG CGCTTGAGCA GTACCCGCCC GGTCAACGAT
CCTTTCGTGG ATTTGGTTAT CGATGCGACC TGGAGTTCCG GCCATATCGT GCGTAGCTAC
ACGATGCTGT TCGACCCGCC GGCGAGCCGC CCGCAGGCGG CGGTGACCGC AGCCCCTCAG
GTAACGTCAC CCCGTGCCGC GGCAACTGCG CCGCGCGCCC CCGCCACCAC CGCGGCGCCC
GCGCCTGCCG CACGCCCAGC AACCCCTCCC GCACCCGCCG TAGCCGGCCG TCCCACCCCC
GCAGAGGCAC CCGCGTTCGC TGGCGGTGAC GAGATTCGCG TGCGCCCCGG GGACACGGCC
GGCCGCATCG CGGAAGCCCA TCGCCCCGCT GGCGTCTCGC TGGACCAGAT GCTGGTAGCA
ATGATGCGCG CCAATCCGGA CGCGTTCGTC AACAGCAATG TGAATCGTCT GCGTTCCGGC
GCAGTGCTGC AGATGCCTAG CGAAACGGAG GCGCAAGCCA CGGACGCCAC CGAAGCTCGC
AAGATCGTCG CCGCGCAAAG CCGCGATTTC AATGAATTCC GTCGCCGCAT GGCAGCGACA
GCGCCCAAGG CCGAGGTTGC AGCCGCAGAG CGCTCCGCGC GCGGCACTGT GCAGACCCAG
GTGGACGAGA GCAAGCCCGC CGCTGCCGGC CCCGATAAGC TCACTCTCTC CAAAGGCAGC
GTTCAGGCCC AGAAGACCGA GGAGCAGATG GCCCGCGACA AGCAGGCCGA ACAGAACAGC
GCCCGCATGG CAGAGCTGTC CAAGAACATC TCGGACCTGA ACCAGCTCAG CGGCGCCTCC
GCGCCAGCCG GCGCTGGCGC AGCTGCGCCA AGAGCCACGG CGCCTTCCGC CCCTGCCAGC
CAGCCGGCGG TGGCAGTACC CGCCGCCGGG GGGTTGCCCG CACCCGCTAC CGCGGCTGCA
AGCGCCGACA TGAGCACTGC TACGGCAGAA GCTGCCGCAC CAGCGCCCGA GGCCTCTGCC
GCCGAGGCGG CTGTTCCCGC CGAAGCAGCA TCCCAGGCTT CTGCCGCAGC ACCCGCTCCC
GCCGCCGTGG CCCCGCGTCC AGCCCCCCAG CCCGTTCCCT ACGAAGAGCC CAGCTTCCTG
GATGCCCTCA CGGAGGACCC GCTGCTGGCT GGAGGCGCGC TGGCGCTCGT ACTTGCGTTG
CTGGGCTATG GCGGCTACCG CGTGGTCCAA AGCCGCCGCA ACCAGGGTGC CCTGGACAGT
TCGTTCTCGG AAAGCAGCCT GCAGCCGGAT TCTTTCTTCG GCGCCAGTGG CGGCCAGCGT
GTGGACACGG CCAACAGCGA ACTCACGACC GGCTCCTCGT TCATGACCTA TTCGCCCAGC
CAACTGGATG CCGGTGGCGA TGTGGATCCG GTGGCGGAAG CCGACGTCTA CCTCGCCTAC
GGCCGTGACC TGCAGGCGGA GGAAATTCTC AAGGAGGCGC TGCGCCACCA TCCGGAGCGC
GTCTCCATCC CCGCCAAACT GGCCGAGATT TATGCCAAGC GCCAGGACCG CAAGGCGCTG
GAGTCCGTCG CCAACGACGT GTTCCGGCTC ACCAACGGCC AGGGGCCCGA CTGGACCCGC
GTGTCCGACC TGGGCCGTAC GCTCGACCCT GAAAATCCGC TGTACCAGCC CGGTGGCCGT
CCTGCCGTTG CCGCCAGCGC CACAGCGGCC GCTGCATCGA CCGCGGCCTT TGCAAGCACG
CTGGGCGCAG CAACTGCTCC CGCGACACCC ACGGGCGGAC CGGACTCTGT GCTGCCAGAC
CTCGATCTGG ACTTGGATCT CGACCTGCAC GAAGCACCGT CGGCCCCGGC CCCGGCGCCA
AGCACGTTTG CCATGGCCGC AGCCAACAAC ACCGCCGCAG CCGCGCCGGT AGTGTCGGCC
GCCCAGGCGC CCGTCCCGTC CCTGGATCTG GGCGACCTGG AACTACCCCA GGCCTCCTGG
GACGAGCCAG CGGTGACGCC GCAGGCGACG ACAGAGCCCG ACGTGCAAAG CGAGCCTCTT
CCGTTGAACC TGGACGATGA CTTGTCCCTG ATGGACTCCG GCGTCGCCCC CCTGACCAGC
CGCGACGCGA AGGCTGTCAC CTCGGAATCC CTGGAATTCG ACCTCGGCGA TCTGTCGCTG
GACCTTGACA CCCCCACGGC GGCCGCTCCC GTCGCGCCCT CGGTCGCCGC CAAGGCAGCG
TCCTCCGCTG CCGACGCGCT GCCGGACGAC CCTCTGGCCA CCAAGCTCGC ACTGGCGGAA
GAGTTCAACA CCATTGGCGA CAGCGAAGGC GCGCGGGCCC TGGTGGAAGA GGTCATCGCC
GAATCGTCCG GTGAGCTGAA GGCCCGCGCC CAGCGACTGC TCGCCGAACT GGGCTGA
 
Protein sequence
MVLFTYFHAP GRLDHDQKNT APSAPSISSF VAHMHRWKFS VLATAAILSA GLYTTDASAL 
ALGRVNVQSA LGEPLRAEIE LPQITAAEAE SLRVSTASPE VFRSQGMEYS PAAHSVQVQL
HRRTNGSMVL RLSSTRPVND PFVDLVIDAT WSSGHIVRSY TMLFDPPASR PQAAVTAAPQ
VTSPRAAATA PRAPATTAAP APAARPATPP APAVAGRPTP AEAPAFAGGD EIRVRPGDTA
GRIAEAHRPA GVSLDQMLVA MMRANPDAFV NSNVNRLRSG AVLQMPSETE AQATDATEAR
KIVAAQSRDF NEFRRRMAAT APKAEVAAAE RSARGTVQTQ VDESKPAAAG PDKLTLSKGS
VQAQKTEEQM ARDKQAEQNS ARMAELSKNI SDLNQLSGAS APAGAGAAAP RATAPSAPAS
QPAVAVPAAG GLPAPATAAA SADMSTATAE AAAPAPEASA AEAAVPAEAA SQASAAAPAP
AAVAPRPAPQ PVPYEEPSFL DALTEDPLLA GGALALVLAL LGYGGYRVVQ SRRNQGALDS
SFSESSLQPD SFFGASGGQR VDTANSELTT GSSFMTYSPS QLDAGGDVDP VAEADVYLAY
GRDLQAEEIL KEALRHHPER VSIPAKLAEI YAKRQDRKAL ESVANDVFRL TNGQGPDWTR
VSDLGRTLDP ENPLYQPGGR PAVAASATAA AASTAAFAST LGAATAPATP TGGPDSVLPD
LDLDLDLDLH EAPSAPAPAP STFAMAAANN TAAAAPVVSA AQAPVPSLDL GDLELPQASW
DEPAVTPQAT TEPDVQSEPL PLNLDDDLSL MDSGVAPLTS RDAKAVTSES LEFDLGDLSL
DLDTPTAAAP VAPSVAAKAA SSAADALPDD PLATKLALAE EFNTIGDSEG ARALVEEVIA
ESSGELKARA QRLLAELG