Gene Veis_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4004 
Symbol 
ID4694641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4394418 
End bp4395437 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID639851752 
Producthypothetical protein 
Protein accessionYP_998728 
Protein GI121610921 
COG category[S] Function unknown 
COG ID[COG3181] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.506172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0371737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGCAT TCCGATCTTC CTCCCTCCAC CACCATCACC ATCACCACCA CCGGCGGCGG 
CTGTTGGCCG CACTGGGCAC CGCCGCGTGT CTTGCGGCCG CTGCCCCGGC AGGCCAGGCG
CAAGGCTATC CGAACAAGCC GGTCCGGCTG GTCGTGGCGT ACCCGCCGGG CGGCGCAACG
GATATCGTGG CGCGCGTGCT GGCGCAAAAG CTGTCGGAGC AGACCGGCCA GCAATTCATC
GTCGACAACC GGCCCGGCGC GGGCGGCAAC ACGGGTGCCG AGTGGGCGGC GCGCAGCGCG
CCCGATGGCT ACACGCTGGT GCTGGCAACC ACTGCGCATG CGATCAGCCC CGCGCTCTTC
AAAAACCTCG GCTACAAGCT CGACAAAGAC TTTGCGCCCG TGTCGCAGCT CACCAGCGGC
CCGCTCGTGA TCGTGGCGCA CCCCGGACTG CCAGCGAACG ACGTGACCGA GCTCATCGCG
CTGGCCAAGG CCAGGCCGGG CGTGCTCAAT TTCGCGTCAT CGGGCAACGG CCAGTCGACC
CACCTCGCGG CCGAACTGTT CGCCTCGATG GCCGGCGTGA AGATGGCGCA CATCCCGTAC
AAGGGCAGCG CCCCGGCGCT GACCGACGTG ATGGGCGGCC AGGCGCAGCT GATGTTCGAC
ACCATCCTCT CGGCCATGCC GCAGGTGAAG GCCGGCAAGC TCAAGGCGCT GGCCGTGACC
AGCGCCAAGC GCTCGGGCGC GGCGCCCGAA CTGCCGACCG TGGCCGAGTC CGGCCTGCCG
GGCTACGAGG CCATCGCCTG GAATGGCCTG CTGGCGCCGG CCGGCACGCC GCCGGAGGTG
ATCGCGCGCC TGAACGCCGA ACTGAAGAAG GCGCTGGCGC TGCCCGAAGT GAAGGACAGG
TTCGAGGCCC AGGGCTTTGC CGCCGCGTGG AACACGCCCG AGGCTTTCGG CGACTTCATG
AACGCGCAGG TCAAGAAATG GGCGCAGGTG GTGCAGGTGT CGGGCGCCAC GCTGGACTAG
 
Protein sequence
MHAFRSSSLH HHHHHHHRRR LLAALGTAAC LAAAAPAGQA QGYPNKPVRL VVAYPPGGAT 
DIVARVLAQK LSEQTGQQFI VDNRPGAGGN TGAEWAARSA PDGYTLVLAT TAHAISPALF
KNLGYKLDKD FAPVSQLTSG PLVIVAHPGL PANDVTELIA LAKARPGVLN FASSGNGQST
HLAAELFASM AGVKMAHIPY KGSAPALTDV MGGQAQLMFD TILSAMPQVK AGKLKALAVT
SAKRSGAAPE LPTVAESGLP GYEAIAWNGL LAPAGTPPEV IARLNAELKK ALALPEVKDR
FEAQGFAAAW NTPEAFGDFM NAQVKKWAQV VQVSGATLD