Gene Veis_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4398 
Symbol 
ID4694421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4845634 
End bp4847154 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID639852147 
Productflagellin domain-containing protein 
Protein accessionYP_999119 
Protein GI121611312 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.308154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.749909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA CGATCAACAC CAATATTGCG TCGCTGACCG CACAGCGCAA TCTGGGCCTG 
AGCCAGTCGT CGCTGAACAC CTCGATACAG CGCCTGTCCT CGGGCCTGCG CATCAACAGC
GCCAAGGACG ATGCGGCGGG GCTGGCGATT TCCGAGCGCT TTACGAGCCA GATTCGCGGC
CTGGACCAGG CGGCGCGCAA CGCCAATGAC GGCATCTCGC TGGCCCAGGT CACCGAAGGG
GCGATGAAGT CGGCCAGCGA CATCTTGCAG CGGGTGCGCG AGTTGGCGGT GCAGTCGGCC
AATGCGTCCA ACAGCCCCGG CGACCGCCAG GCGCTGAACC AGGAAGTGGG CCAGTTGGTC
AGCGAGCTCG ATCGCATTGC GCAAACCACT GAATTCAACG GCGCCAAATT GCTCGACGGC
AGCTTTGGCA CGCAGCAGTT CCAGGTGGGC GCCAATGCCA ACCAGACCAT CGTCGCGGCC
ACGGCCAATC TGCGCACCGG CGTGTATGGC AACAACCAAA ACACGGCGGC CAATGGCGCC
GGCGCCGATG CCAACATGGG CGCCGATGCC GCCTGGGGGA GCAACGGTGT CGGCACCGGG
GCATTGGCCA TCAGCGGCGC GCTCGGCTCG GCCAGCATCG GCATCGAGGC GAACCACACG
GCCAAGGCCA TGGCCGACGC CATCAACCTG AAAACCGCCG ATACCGGCGT CACGGCCTCG
GCGCGCACCG AGGTGCAGTT GTCCTTTTCT GCGCCCGGCG CTTACACCTT CCAGTTGCGC
AGCGAAAACC GCCCGAACCC GCCCGCGCTC GGGCAGCCGA TGGCGTTCCA TGTGACGGCG
ACCGGTACGA TCGACGGCTT GTCGAACGCG ATTGCCGCGA TCAATGAGCA ATCGGCCAAG
ACCGGGGTCA CCGCCGCGCT GAACCCGGGC GCCACCGGCA TCGTGCTGAC CAACACCACG
GGGCAGGACA TCGGCCTGTA CAAAAGCGCC AGTGACAGCG GCAATGCGGG CACGATCACT
GTCCAAAAGC AAAACGCCGA TGGCCTGCCC GCAGGCAGCG CGGGCGCCTT GGCGGCCGCC
GCCGGCGTTG GCAATGCCAC CGTCAGCGGG TATGTGGTGC TCGATGCGAA CAAGCCCTTT
TCCACGACCG TCACCACCAC GAACGCTTTC AACACCACGG CTCCCGCCGA CTCCGCCTCC
TCGCTGCAAG AAGTGGCCGG CCTGGATGTG ACGACATTCA AAAATGCGAC CGAGGCCCTC
AAGACCGTGG ACTCTGCGCT GTCGTTCATC AATGGCGAGC GCGCCAAGCT CGGCGCGTTG
CAGTCGCGCT TCGAGAGCAC CATCGCCTCG CTGAACATCA CCTCGGAAAA CCTGTCGGCA
TCGCGCTCGC GCATCCTCGA CGCCGACTTC GCCACCGAGA CGGCGAACCT GTCGCGCGCC
CAAATCCTGC AACAGGCCGG CACCGCGATG GTGGCCCAGG CGAACCAGAT TCCGCAAGGC
GTGCTCAAGC TGTTGCAGTA G
 
Protein sequence
MAATINTNIA SLTAQRNLGL SQSSLNTSIQ RLSSGLRINS AKDDAAGLAI SERFTSQIRG 
LDQAARNAND GISLAQVTEG AMKSASDILQ RVRELAVQSA NASNSPGDRQ ALNQEVGQLV
SELDRIAQTT EFNGAKLLDG SFGTQQFQVG ANANQTIVAA TANLRTGVYG NNQNTAANGA
GADANMGADA AWGSNGVGTG ALAISGALGS ASIGIEANHT AKAMADAINL KTADTGVTAS
ARTEVQLSFS APGAYTFQLR SENRPNPPAL GQPMAFHVTA TGTIDGLSNA IAAINEQSAK
TGVTAALNPG ATGIVLTNTT GQDIGLYKSA SDSGNAGTIT VQKQNADGLP AGSAGALAAA
AGVGNATVSG YVVLDANKPF STTVTTTNAF NTTAPADSAS SLQEVAGLDV TTFKNATEAL
KTVDSALSFI NGERAKLGAL QSRFESTIAS LNITSENLSA SRSRILDADF ATETANLSRA
QILQQAGTAM VAQANQIPQG VLKLLQ