Gene TM1040_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1668 
Symbol 
ID4075771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1767772 
End bp1768983 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID638006981 
Productphage major capsid protein, HK97 
Protein accessionYP_613663 
Protein GI99081509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAG AAAACACCAG GTCGGTGGCG GAACTCGCTG CCGAAATCAA AGCCGACCAC 
GCCAAGAGCG TTGATGCCGT CAAGGCCATT GCCGAGGAAG CCCTGGGCAA AGCTCAGAGC
GGCGAAAAGT TGGCGAACGA CCTGAAGGAA AAGGCCGACG AGGCCCTGAC CGAAATGAAC
GGCTTCAAAG CGTCTCTGGA CGCGCTGGAG CAAAAGCTGG CGCGCGGCGC GGGCGGCGAA
GGTAGCGACG GCGAAAAGTC GCTGGGACAG CGCTTCGTTG AGAGCGAGGG CTTCAAGTCG
TTCAAGGACG GCGGCTTTGA CCGTCACAGC AAGGCGAAGC TGGAAACCAA GGCGACCCTG
ACCCTGGCGA CCACTGACAC AGATGGCGCC GTTGGCGATG GTGTAGCCCC GACCCGCCTG
CCGGGCATCC AGGGCTTGCC GCAGCGCCGC CTGACCATCC GCGATCTGCT GGCGCAGGGC
CGCATGGATG GCAACACCAT CGAGTACGTG CAGGAGACCG GTTTCAACAA CAACGCGGCT
CCGGTGGCCG AAGGCGCTGC AAAGCCGTCC TCAGATATCA AGCTGGACGT GAAAACCACC
ACTGCCAAGG TGATCGCGCA CTGGATGAAG GCATCGCGCC AGGCGCTGGA TGATGTTTCC
GCCCTGCGCT CGATGATCGA CCAGCGCCTG CTGTTCGGCC TGGCGCTGGC GGAAGAAAAC
CAGCTTTTGA ACGGTGACGG CACCGGCCAG AACCTGTCCG GCCTGATCAC CAACGCCACA
GCCTATTCGG CGGCGTTTGC GCCGGCATCC GAGACCGCAA TCGACAAGAT GCGCCTCGCC
ATGCTGCAAG CGGCTCTGGC TGAATACCCG GCAACGGGAC ACGTGATGCA CCCGACCGAC
TGGGCACGGA TCGAGCTGAC CAAGGACGGC AACGCCAACT ACATCATCGG CAAGCCGCAA
GGCACCATCG CGCCGACCCT CTGGGGCCTG CCGGTTGTGG CTACACAGGC GATCACCGTG
GACAAGTTCC TGACCGGTGC GTTCAACATG GGTGCTCAGA TCTTCGACCG CTGGGATGCG
ACGGTCGAAA CCGGCTACGA GAATGACGAC TTCACCAAGA ACCTCGTCAC CATCCTGGCC
GAGGAGCGTC TGGCGCTGGC GATATTCCGC CCCGAAGCGT TTATCTACGG CGATCTGGGC
TACGTGGCCT AA
 
Protein sequence
MAEENTRSVA ELAAEIKADH AKSVDAVKAI AEEALGKAQS GEKLANDLKE KADEALTEMN 
GFKASLDALE QKLARGAGGE GSDGEKSLGQ RFVESEGFKS FKDGGFDRHS KAKLETKATL
TLATTDTDGA VGDGVAPTRL PGIQGLPQRR LTIRDLLAQG RMDGNTIEYV QETGFNNNAA
PVAEGAAKPS SDIKLDVKTT TAKVIAHWMK ASRQALDDVS ALRSMIDQRL LFGLALAEEN
QLLNGDGTGQ NLSGLITNAT AYSAAFAPAS ETAIDKMRLA MLQAALAEYP ATGHVMHPTD
WARIELTKDG NANYIIGKPQ GTIAPTLWGL PVVATQAITV DKFLTGAFNM GAQIFDRWDA
TVETGYENDD FTKNLVTILA EERLALAIFR PEAFIYGDLG YVA