Gene TM1040_2627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2627 
Symbol 
ID4077930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2759858 
End bp2761072 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content65% 
IMG OID638007951 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_614621 
Protein GI99082467 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.800354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGA CCGCGCCCTC GCCCCCAGCC CCAGCCTCTG CCCCCGGCTC CGATCCCGGT 
TCTGATCCCG GCCCTGCCCA CGTCCGGCCC CCCGGACGGG ATGCAACACA AGGCCCGCGC
CTGTTGTTCT TTTCCGGTGG CACGGCCCTC AACGAGATCT CGCGCAGGCT CAAGGCCTAC
ACGCAGAACT CGGTGCATCT GATTACACCC TTCGACAGCG GTGGCTCATC GCAGGTGCTG
CGCAAAGCCT TTGGCATGCC CGCGGTCGGC GACCTGCGCA GTCGACTTAT GGCGCTGGCG
GATGAAACCG ATAGAGGCCA GCCAGAGATC CTGCGCCTGT TCACCCATCG CTTTGCAAAA
CACGCGTCAG AGCGGAATGT GACACAGGAC GCTGCCCGGC TCTTTGAGGG CACGCATCCG
CTTTTGCAGG GGATTCCAAC GCCGGTACGA CAGCAAATCC GCGAAGACCT GCGCCAGTTT
CAGGACGCCG CCCCGGCGGA TCTGGACTAT CGCAACGCCA GCATCGGCAA CCTGATCCTC
GCGGGGGGCT ATCTGCGTCA CGGCCGCCAG CTTGAGCCGG TGCTTGCCCA GATGTCGCGG
ATGGTGGCGG TGCGCGGCAC CGTGCGCCCG ATTGCGGATG TGAACCTGGA GATCGGTGCA
GAGCTTCGGG ACGGGCGGCG CGTCATCGGT CAGCGCCGGA TGACGGGCAA GGAGCACGCA
CCGCTCACCA GCCCTATCGC GCGCCTCTTT CTGTCAGATG GCACCCGCGA ACTGCCTGCG
GATGCGGTGC CCCTCCCGCA AAGCAACCAA GACCTCATCG CCGGGGCGGA CCTGATCTGC
TACCCGCCCG GCAGTCTCTA TTCGAGCGTG ATCTGCAATC TCCTGCCCAA AGGTGTGGGC
CAGGCCATTG CCGCGCGCAA CGTGCCAAAG GTCTATGTCC CAAGCCTCGG CACAGATCCG
GAATGTCTGG CGATGACGCT CTCGGATCAG ATCTCTGCCT TACTGGCACC GCTGCGCCGG
GACGCTGGTG ATGTGGCCAC CTCAGCCTTT CTCAGCCATG TGATCTGTGA CCTCAGCGTG
TCAGAGGCGG CACGCGCAGA GGTCCTGCGC GATCACGGCA TCCCCTGCAT CGCGCGGCCT
TTGGCGGTCT CTTCGGGCAA GTCGCCCTGC TATGCGCCGG ATGCCCTCTG CAGACAGCTA
CTGGCACTGG CCTGA
 
Protein sequence
MSKTAPSPPA PASAPGSDPG SDPGPAHVRP PGRDATQGPR LLFFSGGTAL NEISRRLKAY 
TQNSVHLITP FDSGGSSQVL RKAFGMPAVG DLRSRLMALA DETDRGQPEI LRLFTHRFAK
HASERNVTQD AARLFEGTHP LLQGIPTPVR QQIREDLRQF QDAAPADLDY RNASIGNLIL
AGGYLRHGRQ LEPVLAQMSR MVAVRGTVRP IADVNLEIGA ELRDGRRVIG QRRMTGKEHA
PLTSPIARLF LSDGTRELPA DAVPLPQSNQ DLIAGADLIC YPPGSLYSSV ICNLLPKGVG
QAIAARNVPK VYVPSLGTDP ECLAMTLSDQ ISALLAPLRR DAGDVATSAF LSHVICDLSV
SEAARAEVLR DHGIPCIARP LAVSSGKSPC YAPDALCRQL LALA