Gene TM1040_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1407 
Symbol 
ID4078037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1499580 
End bp1501970 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content59% 
IMG OID638006717 
Productsurface antigen (D15) 
Protein accessionYP_613402 
Protein GI99081248 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.308963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTGGG GGAAAAACGC GGCATGTGCC CGCGATGCGC GCCGAGAGGG CGCAATGTCC 
GTTTCTTTTA AGGGCTTTCT GGGAACCACC GCGCTTTCTG CGGTGCTGGC GATCGGGCTT
GGGATCGCGC CGCTGCCGGC GCAGTCTCAG GAATTCCGGT TCACCAATGT ACGCGTCGAG
GGAAACCAGC GCATTCAAAG CTCGACCATC GTGGCCTATA CGGGCCTCTC GCGAGGAGAG
CGGGTGAGCG GTGGTGAGCT CAACGATGCC TACCGCGGCG TGTTTGACAG CGGTTTGTTC
GAGTCTGTCG AACTGGTGCC CCGTGGCAAT ACGCTCGTCA TCAAAGTGGT GGAATTTCCC
ACCATCAGCC GGATCAGTTT TGAGGGCAAC AAACGTCTGA AGGACGACGC ACTTGGCGAG
GTGATTGAAT CCTCGCCGCG CCGTGTGTTC AGCGCCGATC AGGCAGAACG CGATGCGGGC
GCAATTGCTG AACTCTATCG CGCGCAGGGC CGTCTGGCCT CTCGGGTGAC GCCGCGCATC
ATCCGCCGCA GCGACAATCG CGTCGATCTC ATCTTTGAGA TCTCCGAAGG CGATACCGTC
GAAGTCGAGC GCGTCTCCTT TGTCGGCAAC CGCGCGTTCT CGGATCGGCG TTTGCGGCGC
ATCCTTGAGA CCAAACAGGC CACATTCCTG CGGGCTCTGA TCAACCGGGA TACGCTGATC
GAGGATCGTA TCCAGTTCGA CCGTCAGGTG TTGACTGATT TCTACCGCTC GCGCGGCTAT
GTCGATTTCC GGGTCAACAG CGTCAATGCC GAAGTTACAA AGGAGCGCGA TGCGGCTTTC
CTCGTGGTGG ATGTGACCGA AGGCCAGCAG TTTGAGTTTG GCGAGATCAC CGTCACCAGC
GAGTTTGCCG ATGCAGATCC GGATCTCTAC CGCAGTAAAT TGCGCCTGAA GCCGGGCGTG
ACCTACTCGC CGCTGTTGAT CGAAAATGCG ATTGAGACTC TCGAGCAACT CGCGGTGCGT
CAGGGGATTG ATTTCCTGCG GGTGGAGCCG CGTGTCACGC GCAATGATCG GGATCTGACA
CTCGATGTTG AATTTGCAAT GGTCCGCGGA CCGCGGGTCT TTGTCGAGCG CATCGACATT
GAAGGCAACA CCACCACTCT CGACCGGGTG ATCCGTCGAC AGTTCCGCAT CGTTGAGGGC
GACCCTCTCA ATCAGCGCGA AATCCGCAAC AGCGCCGAAC GTATCAAGGC TCTGGGCTTC
TTCTCGGAAT CCGAGGTGGA TGTGCGTCAG GGCAGCACGC CAAGCGAGGT CATTGTTGAT
GTCGATGTGG TCGAGCAACC CACCGGTTCG TTCAACCTTG GTGGTTCCTA CTCGGTGGAC
GACGGGATCG GGGTGGCAAT CGGGATTTCC GAAAAAAACT TCCTTGGACG CGGTCAGCAA
CTGTCGTTCA ATATTTCGAC CACAGAAGAC ACCGAAGCCT ACAATCTGCG CTTTGGCGAG
CCCAAGCTTC TCGGCCGTGA TCTGCAGTTC GCCATCGACC TCGGTCTGAG CGAAACAGAG
TCGAGCTACT CCGAGTATGA CACCAAAGAG GTGGTCTTCC GCCCATCCAT CACCTTTGAT
GCCAGCGAAA CCACATCGTT GCAGTTGCGG TATGGGTTTG AGCAGGACGA GATGGAAAGC
CGTGGTCTCG ACAGCAACAA TCAACCTCTG GTTGGTCCGG TGCTTCAGCG CGAGATCGCA
GAAGGCAAGC GCACGACAAG CTCCATCGGG GCGACCTTCA CCTATGACAG CCGTCGCACC
GGTCTCAACC CAACGGCCGG GTTCCTTGTC CAGAGCGGCA TCGATTATGC GGGTCTTGGT
GGGGACAATG AATACATCCG TGCCACCAGC AAGATCGTGG CGCAGAAGCT TGTGTTCAAC
GAAGAGGTCA CTCTGCGCGC CACCGTGGAA GCAGGCTATC TTGGCTGGCT CGGTGACAAT
CGCAGCCGCA CCATCGATCG CTTTCTTCTG ACTTCGGACA CTCTGCGTGG CTTTGAGCCT
GGCGGGATTG GTCCGCGTGA TATGAGCGGC TCTTACGACG ACGCGCTCGG CGGCAACCTT
TATGCGGTCG CGCGTTTTGA TGCGGAGTTC CCTCTGGGCC TGCCGGAAGA GCTCGGCCTG
CGTGGCGGTC TCTTTTATGA CGTCGGCAAC CTCTGGGGGC TGGAAGATTC CAACGCTGAT
TTCGGCAGCA ATGTCGTCGG ACGCGACGGC TCCTTCCGCC ATGTCGTCGG CTTTTCGCTT
CTCTGGACCA CGGGCCTTGG ACCATTGCGG TTCAACTTCT CCAAGGCACT GGTCAAGGAA
GACTTCGACA AGGAGCGGAA CTTCGACCTG ACCATTCAGG CGCGGTTCTA A
 
Protein sequence
MVWGKNAACA RDARREGAMS VSFKGFLGTT ALSAVLAIGL GIAPLPAQSQ EFRFTNVRVE 
GNQRIQSSTI VAYTGLSRGE RVSGGELNDA YRGVFDSGLF ESVELVPRGN TLVIKVVEFP
TISRISFEGN KRLKDDALGE VIESSPRRVF SADQAERDAG AIAELYRAQG RLASRVTPRI
IRRSDNRVDL IFEISEGDTV EVERVSFVGN RAFSDRRLRR ILETKQATFL RALINRDTLI
EDRIQFDRQV LTDFYRSRGY VDFRVNSVNA EVTKERDAAF LVVDVTEGQQ FEFGEITVTS
EFADADPDLY RSKLRLKPGV TYSPLLIENA IETLEQLAVR QGIDFLRVEP RVTRNDRDLT
LDVEFAMVRG PRVFVERIDI EGNTTTLDRV IRRQFRIVEG DPLNQREIRN SAERIKALGF
FSESEVDVRQ GSTPSEVIVD VDVVEQPTGS FNLGGSYSVD DGIGVAIGIS EKNFLGRGQQ
LSFNISTTED TEAYNLRFGE PKLLGRDLQF AIDLGLSETE SSYSEYDTKE VVFRPSITFD
ASETTSLQLR YGFEQDEMES RGLDSNNQPL VGPVLQREIA EGKRTTSSIG ATFTYDSRRT
GLNPTAGFLV QSGIDYAGLG GDNEYIRATS KIVAQKLVFN EEVTLRATVE AGYLGWLGDN
RSRTIDRFLL TSDTLRGFEP GGIGPRDMSG SYDDALGGNL YAVARFDAEF PLGLPEELGL
RGGLFYDVGN LWGLEDSNAD FGSNVVGRDG SFRHVVGFSL LWTTGLGPLR FNFSKALVKE
DFDKERNFDL TIQARF