Gene Meso_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_4000 
Symbol 
ID4181147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp4306380 
End bp4307882 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content62% 
IMG OID638069896 
Producttype II secretion system protein E 
Protein accessionYP_676532 
Protein GI110636324 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.737782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGGA GAAGAGGAAC AGGCGACACC ACAGGCACAG TCAGGTCACG CCTCGTCGTT 
CAGCCGGCGC TGGCTCACGC CGAAGCAGAT ATCTCTGCTC CCAAGCCCCG TCCGCAGCAG
CCGGATGCAG CGGGCGCAAT TGTTGCTCCG CCTGCCGCCT CCAGTCAGGA GAAACCCGCG
CCCGCTCCTG CCCAGGCGCC GGCTCGCCCG GCGGCTCACA GCGACAGCTA TTACGACACC
AAGGGCCAGG TTTTTTCCGC CCTCATCGAT ACGATCGATC TTTCCCAACT GGCGAAGCTC
GACGCCGACA GCGCCCGGGA AGAAATACGC GACATCGTCA ACGACATCAT CGCAATCAAA
AATCTTGCGA TGTCGATATC CGAGCAGGAA GAGCTGCTGG AGGACATCTG CAACGACGTG
CTGGGCTTTG GGCCGCTGGA GCCGCTGCTC GCGCGCGATG AAATCGCCGA TATAATGGTG
AACGGCGCGC GAAACGTCTA CATTGAGGTG AACGGGAAAG TCGAGCAGAC GAATATCCGT
TTCCGCGACA ATCAGCAGCT TCTCAACATC TGTCAGCGCA TCGTAAGCCA GGTTGGCCGG
CGGGTGGACG AGTCGAGCCC GATCTGCGAT GCGCGCCTGC CCGACGGCTC CCGTGTCAAC
GTAATCGCGC CGCCGCTGGC GCTCGACGGC CCGACGCTGA CCATCCGCAA GTTCCGCAAG
GACAAGCTGA CGCTCGATCA GCTTGTCCGG CACGGCAGCA TCTCGCCGGA GGGCGCGGAG
GTGCTGAAAA TTATCGGCAG AGTGCGCTGC AACGTCGTCA TTTCCGGTGG CACGGGCTCC
GGCAAGACCA CGTTGCTCAA CTGTCTCACG AATTTCATAG ACCGGGATGA GCGTGTCATA
ACCTGCGAGG ATTCGGCCGA ACTTCAACTG CAGCAGCCTC ATGTAGTGCG GCTCGAAACC
CGCCCGCCCA ACCTCGAAGG CGAAGGCGAG GTGACGATGC GCGATCTCGT GCGCAACTGC
TTGCGTATGC GGCCCGAACG TATCATCGTG GGTGAGGTGC GCGGGCCCGA GGTGTTCGAC
CTGCTCCAGG CCATGAACAC GGGTCATGAC GGGTCCATGG GCACGATCCA TTCCAACAGC
CCGCGTGAAT GCCTGAACCG CATCGAATCC ATGGTGGCCA TGGGCGGCTT TTCCCTGCCG
CAGAAGACTG TTCGCGAAAT CATCGTCGGC TCTATCGACG TCATCATTCA GGCCGCGCGC
CTGCGTGACG GCTCCCGTCA CATCACCCAC ATAAGCGAGG TGCTGGGTAT GGAAGGGGAT
GTGATCGTCA CTCAAGACCT GGTGGTCTTC GACATAAAGG GGGAGGACGC GTCGGGCCGC
ATCATCGGCA AGCACGTCTC CACCGGCATC GGCCGGCCTG CTTTCTGGGA ACGCGCGCTC
TATTATGGCG AGGAGGCGCG GCTCGCAGCC GCGCTCGAAG CGATGGAAAA GCAGGCTCTC
TGA
 
Protein sequence
MFGRRGTGDT TGTVRSRLVV QPALAHAEAD ISAPKPRPQQ PDAAGAIVAP PAASSQEKPA 
PAPAQAPARP AAHSDSYYDT KGQVFSALID TIDLSQLAKL DADSAREEIR DIVNDIIAIK
NLAMSISEQE ELLEDICNDV LGFGPLEPLL ARDEIADIMV NGARNVYIEV NGKVEQTNIR
FRDNQQLLNI CQRIVSQVGR RVDESSPICD ARLPDGSRVN VIAPPLALDG PTLTIRKFRK
DKLTLDQLVR HGSISPEGAE VLKIIGRVRC NVVISGGTGS GKTTLLNCLT NFIDRDERVI
TCEDSAELQL QQPHVVRLET RPPNLEGEGE VTMRDLVRNC LRMRPERIIV GEVRGPEVFD
LLQAMNTGHD GSMGTIHSNS PRECLNRIES MVAMGGFSLP QKTVREIIVG SIDVIIQAAR
LRDGSRHITH ISEVLGMEGD VIVTQDLVVF DIKGEDASGR IIGKHVSTGI GRPAFWERAL
YYGEEARLAA ALEAMEKQAL