Gene TM1040_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1049 
Symbol 
ID4078107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1127786 
End bp1129018 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content62% 
IMG OID638006353 
Productcytochrome B561 
Protein accessionYP_613044 
Protein GI99080890 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG2353] Uncharacterized conserved protein
[COG3038] Cytochrome B561 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.183438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.102385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCC AAAACACCCC GCAGAGCTAC GGCTCCATCA CCAAGAGCTT TCACTGGCTC 
ACCGCCCTCC TGATCCTGAC CGCCTTTCCG CTTGGATATT TTGCGACCGA ATTGGCAGAG
CATATCCAGA GCAGCGCGTT TGACGGCTCC CAAGCGACGA TTGATCGCGC TACGCTCTTG
TTTTCGCTTC ATAAAACCAT TGGCGTTGCG GTGTTCTTTA CCGCGCTCCT CCGCATCCTC
TGGGCCATCA CCCAGGAAAA GCCGGGCCTG TTGCACCCGG ATCGCAAGCT GGAGGCCTGG
GCGGCAGAGA CGGCCCATTG GGTGCTCTAT GGTGCGATGG TGATTGTGCC GCTTTCGGGC
TGGATTCATC ACGCCGCGAC CGATGGGTTT GCGCCCATCT GGTGGCCCTT TGGGCAAAAC
CTGCCGCTGG TGCCCAAATC CGAGTTTGTC TCCAAACTGT CTTCTTCGGT GCATTTCTAC
GCGATGCTGT TGCTTGGCGC GTCGATCCTG GCGCATGTGG GTGGCGCGCT CAAACATCAT
GTGATCGATA AAGACAGCAC GTTGGTGCGC ATGCTGCCGG GCCGTCGCGC TCTGCCCGAG
CCGCCCGCAC AACACCATTC TGCCTTGCCG CTGCTGACAG CGCTCGTGGT CTGGGGCGCG
GTGATCGGTG GCAGCACAAT GCTCTTTCTA AACACCCAGA GCGCCAAAGG CACCGTGGCA
CCGGTGGCAG CTGCACCGGT TGAAGGCTCG GGCTGGACTG TCGAAAACGG AACGCTGGCG
ATCGAAGTGG TCCAGATGGG CAGCGCCATC ACGGGTACAT TCTCCGACTG GCGCGCCAAG
ATCGACTTTG AAGAACCCGC CAGCCCCGGC CCGGCCGGTC GTGTCGAGGT CGCCATTGCC
ATCCCATCGC TCACGCTCGG GTCTGTGACC GATCAGGCCA TGGGGTCGGA CTACTTTGAC
GCCGAGACCT ATCCGCAGGC GACGTTTGAG GCCGAGATCA TCCAGATCGA GGGCGCGCAG
TACGAGGCAA AAGGCACGCT CACCATTCGC GATCAGACGG TGGCGACCAC CCTGCCCTTC
ACGCTCGATC TTGACGGGGA TACGGCCACC ATGAGCGGGC GAACGGAGGT TAACCGTCTC
GATTTCAACA TCGGGACCGG CACGCAAGAC GAGGGCACCC TGGCCTTTGG CGTCGACATC
ACGGTGGATC TGGTCGCGAC ACGCGCGCCC TGA
 
Protein sequence
MSRQNTPQSY GSITKSFHWL TALLILTAFP LGYFATELAE HIQSSAFDGS QATIDRATLL 
FSLHKTIGVA VFFTALLRIL WAITQEKPGL LHPDRKLEAW AAETAHWVLY GAMVIVPLSG
WIHHAATDGF APIWWPFGQN LPLVPKSEFV SKLSSSVHFY AMLLLGASIL AHVGGALKHH
VIDKDSTLVR MLPGRRALPE PPAQHHSALP LLTALVVWGA VIGGSTMLFL NTQSAKGTVA
PVAAAPVEGS GWTVENGTLA IEVVQMGSAI TGTFSDWRAK IDFEEPASPG PAGRVEVAIA
IPSLTLGSVT DQAMGSDYFD AETYPQATFE AEIIQIEGAQ YEAKGTLTIR DQTVATTLPF
TLDLDGDTAT MSGRTEVNRL DFNIGTGTQD EGTLAFGVDI TVDLVATRAP