Gene Veis_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4433 
Symbol 
ID4694363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4892347 
End bp4895646 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content68% 
IMG OID639852182 
Productouter membrane protein 
Protein accessionYP_999154 
Protein GI121611347 
COG category 
COG ID 
TIGRFAM ID[TIGR02059] cyanobacterial long protein repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.200125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG ATGTGGTGGA CGGCGGACAG GACGCCGGCA CGTCCGGCGC CAGCACGCAG 
GGCAATCGCG CAGTCAGCTT CGCTCCGACA ATGACAATCG ATGTGGACGG CGGACTGGAC
CAAACGGCCC CGGTGCTGAG CACCGGCGCT GACGCCCCGA AGGTCAGCGG CGACAAGTTG
GTGCTCCGCT ACAGCGAAGC GAACCTACTG GGCACGGCCC ACCCCGATCC GGCTGCGCAC
GCCTTTGCGG TGACGGTCAA TGGCAGGACC AACCCCGTCA CCGACGTCGC CGTCGATTCC
CAGACCAAGA CCGTCACGCT GACGCTGACC CGCCCCGTGA CCGGCGGCGA GGATGTGAGC
GTCACCTACA CCAAGCCCAC TGACAGCAGC AAGGCCATAC AGGACGCGGC CGGCAACATC
GCCGACGACA TCACTGCGCC GCTCCGTGTG AACAGCGGAA CAGACACCAC GGCCCCGACA
CTGGTCGCCG ATACCGTGGC CCCGCCGCGT ATCGGCGGCG GTGGCAAGGA ACTGGTGCTC
ACCTACGAAG ACGCGAACTA CCTGAAAGCT GTCCAGGTGC CCAAGACAGC CTTTGCGGTG
CTCGTCAATG GCACGCCCAA CGATGTCACC GCAGTCAAGG TGGATGGCGG GCAGGACAAG
ACCGTCACAC TGATGCTGAG CCGCTCCGTG CCCTCCGGTG CGGAGGTGAG CATCGCCTAC
ACCCCGCCCG GGGAGAGCAC CCAGGCCATA CAGGACAGGC AGGGCAATCC CGCAGCCGGC
ATCGTTGAGC CGATGCATGT GGACAGCGGA CGGGACCAAA CGGCCCCGAT GCTGGTCTTC
GGCGGCATCT CGACGGTCAG CGGCGACAAG TTGGTGCTCG GCTACAGCGA AGCGAACCTA
CTGGGCACGG CCCACCCCGA TCCGGCTGAG GCTGCGCGCG CCTTTGCGGT GACGGTCAAT
GGCAGGACCA ACCCCGTCAC CGGCGTCGCC GTCGATTCCC AGGACAAGAC GGTCACGCTG
ACGCTGACCC GTCCCGTGAC CGGCGGCGAG GATGTGAGAG TCGCCTACAC CAAGCCCGCT
GACAGCAGCA AGGCCATACA GGACGCGGCC GGCAACTTCG CCAAAGACAT CACTGCGCCG
TTCCCTGTGG ACAGCGGAAG AGACACCACG GCCCCGACAC TGGTCACCGA TACCGTGGCC
CCGCCGCGTA TCGGCGGCGG CAAGGAACTG GTGCTCACCT ACGAAGAAGC GAACTACCTG
AAAGCTGTCC AGGTGCCCAA GACAGCCTTT GCGGTGCTCG TCAATGGCAC GCCCAACGAT
GTCACCGCAG TCAAGGTGGA TGACGGGCAG GACAAGACCG TCACACTGAC GCTGAGCCGC
TCCGTGCCCT CCGGTGCGGA TGTGAGCATC GCCTACACCC CGCCCGGGGA GAGCACCGAG
GCCATACAGG ACTGGCAGGG CAATCCCGCA GCCGGCATCC GTGAGCCGAT GCATGTGGAC
AGCGGACGGG ACCAAACGGC CCCGGTGCTG GTCTTCGGCG GCATCTCGAC GGTCAGCGGC
GACAAGTTGG TGCTCAGCTA CGACGAAGAT AACAACCTGG ACACAGTCCA CAAGGCGCTG
GCTGGATCCT TTACGGTGCT CGTCGACGGA TCGCCCAACC CCGTCACCGA GGTCAGCGTG
GCAAACGCAC CGGCCAAGAC GGTCACGCTG ACGCTGACCA AGGCCGTGGC CAGAGGTGTG
CCGGTGACCG TCGCCTACCA GGCCACAGCC GACAACGCGA TACAGGACAT GGCAGGCAAC
CGGGCACTGG ACATCACTGA GCCGATGATC CATGTGGACA ACGGGGAAGC CCCAACGGAC
CCGGCACAGA CCGCTGGCAC ATCCGGCACC GGCACACCCG GCACCACGGC GCATGACGCC
GGCACATCCG GCGTCATCAC AACGGGCGCC ACGCCGCCCG ATGCCAGCAC GTCCGGCACC
GGTACACCCG GCACCACTGC GCCCGACGCA GGCACACCAG GCGTCGTCAC AACGGGCGTC
ACGCTGCACG ACTCCGGCAC ACCCGGCACC GGCACACCCG GCACCGGCAC ATCCGGCACC
GGCACACTTG GCACCGGCAC ATCCGGCACC GGCACATCCG GCACCGGCAC ACTTGGCACC
GGCACATCCG GCACCGGCAC ATCCGGCACC GGCACACCTG GCACCGGCAC ACCTGGCACT
GGCACACCTG GCACCGGCAC ACCTGGCACC GGCACGCCCG GCACCAGCAC ACCTGGCACC
GGCACATCCG GCACCACTGC GCCCGACACC GGCACACCCG GCGTCATCAC AACGGACACC
ACGACGCACG ACGCCGGCAC GCCCGGCACC GGCACCGGCA CACCCGGCAC TACTGCGCCC
GACGCCAGCG TCCCCGGCGT CATCACAACG GGCGCCACTG CGCCCGACGC CGGCACGCCC
GGAGCCAGCG CATCAGGCGC CACGCCGCAC GACACCGGCA CGCCCGGTGC CGGCACCACG
ACGCCCGGCA CCGGCACACC CGGCACTACT GCGCCCGACG CCAGCGCCCC GTCCCCGTCG
GAGAACAACG CCACCCCGTC CCCACCGTCG GACAGCACTC CATCCTCCGG CAGCCCATCG
ACAGCCGACG TGCTGGACAT CGTCAGCCAG TTCAGCATGG TCGGAAACCA AGCCAATGAC
CAGACCCCTG CGACCCCCGT GGCCCCCGTG GCCGGCGACC GCAACGCAGA CGGCGTTCTG
GACCGCACGC AGCCTGCGGT GCACTCGATC AGCGTCACGC CCAACGCCAA CGCCCCCGGC
AGCCTGGCAG ACGCGCCATC CGGCCTGGTC ACGCTGGTCA ACGACAGCCA GGACGGCAAG
GTGCGCCCCG GCAGCCAGGA ACGCATCACC AGCCTGGAGC AGAAAGACGC GCCCGCCCAA
CTGCCGAAGG GAATGGAGAT GCCCATCGGC CTGCTGCACG CGCAAGTGAC GCAGGCCGTG
GACAGCGGCC ATCCGGCAAG CCTGAGCCTG TTCGTCGCCC TGGCGCTGAA CGTCAACGAA
CTGTGGGTGC AGGACAACGG CACCGGCGTC TGGACGAACC TGGCCAGCGC ACCCTACGGC
GGCAAGACGG TGCTGGCAGA CGGCCAGCTC AGGCTGGACA TCCATATCGA TGACGGCGGG
CCGTTCGATG CGGACGGCAA GGTCGACGGC GTGGTCAGCG TCGTGGGGGC AGCGGCGCAT
ATGCCGCTGT CCATCGTCGG ACAGGCGCCC GACGTGGCGC AGCACGGGTT CTGGTTCTGA
 
Protein sequence
MTIDVVDGGQ DAGTSGASTQ GNRAVSFAPT MTIDVDGGLD QTAPVLSTGA DAPKVSGDKL 
VLRYSEANLL GTAHPDPAAH AFAVTVNGRT NPVTDVAVDS QTKTVTLTLT RPVTGGEDVS
VTYTKPTDSS KAIQDAAGNI ADDITAPLRV NSGTDTTAPT LVADTVAPPR IGGGGKELVL
TYEDANYLKA VQVPKTAFAV LVNGTPNDVT AVKVDGGQDK TVTLMLSRSV PSGAEVSIAY
TPPGESTQAI QDRQGNPAAG IVEPMHVDSG RDQTAPMLVF GGISTVSGDK LVLGYSEANL
LGTAHPDPAE AARAFAVTVN GRTNPVTGVA VDSQDKTVTL TLTRPVTGGE DVRVAYTKPA
DSSKAIQDAA GNFAKDITAP FPVDSGRDTT APTLVTDTVA PPRIGGGKEL VLTYEEANYL
KAVQVPKTAF AVLVNGTPND VTAVKVDDGQ DKTVTLTLSR SVPSGADVSI AYTPPGESTE
AIQDWQGNPA AGIREPMHVD SGRDQTAPVL VFGGISTVSG DKLVLSYDED NNLDTVHKAL
AGSFTVLVDG SPNPVTEVSV ANAPAKTVTL TLTKAVARGV PVTVAYQATA DNAIQDMAGN
RALDITEPMI HVDNGEAPTD PAQTAGTSGT GTPGTTAHDA GTSGVITTGA TPPDASTSGT
GTPGTTAPDA GTPGVVTTGV TLHDSGTPGT GTPGTGTSGT GTLGTGTSGT GTSGTGTLGT
GTSGTGTSGT GTPGTGTPGT GTPGTGTPGT GTPGTSTPGT GTSGTTAPDT GTPGVITTDT
TTHDAGTPGT GTGTPGTTAP DASVPGVITT GATAPDAGTP GASASGATPH DTGTPGAGTT
TPGTGTPGTT APDASAPSPS ENNATPSPPS DSTPSSGSPS TADVLDIVSQ FSMVGNQAND
QTPATPVAPV AGDRNADGVL DRTQPAVHSI SVTPNANAPG SLADAPSGLV TLVNDSQDGK
VRPGSQERIT SLEQKDAPAQ LPKGMEMPIG LLHAQVTQAV DSGHPASLSL FVALALNVNE
LWVQDNGTGV WTNLASAPYG GKTVLADGQL RLDIHIDDGG PFDADGKVDG VVSVVGAAAH
MPLSIVGQAP DVAQHGFWF