Gene TM1040_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1142 
Symbol 
ID4078438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1227757 
End bp1229316 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content66% 
IMG OID638006446 
Producthypothetical protein 
Protein accessionYP_613137 
Protein GI99080983 
COG category[S] Function unknown 
COG ID[COG2861] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0823689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCG CGCGTGAGAA AGAGCACAGG CGAGAAGGCG AAACAGGCAT GCGAGGATTT 
CTGGGTGGCG TGAGCGTAGG GGCGCTGGTC GCAGTTGCGG GGGCGGCGGT GTGGTCGCTG
TCGACGCCAT TGCCGCAGGC GGTCGATGTC TCGCAGGAAC TGCCCGTCAC AACATTGCAG
CCGCGTGAGG TTCCAGCGCC CCCGTCCGAT GTGCCCGGAA GCGACGCAGA TCTGGTTGAG
GCCGCCCCAG CGGAACCCGA TGCAAATGCC CTTGGCCGAG ACAGCGCCCC AGATGAGATC
GACACTTCGC TGCCAGAGCG TCCCAGCGTT TCGACCGATC CCGAGGTCAC ATTGGATGGC
GCCCCGGTTC AGAGCGACAC CCCCCGGATT GCGGTGACTT CCGATGCGGA TGCGCCCGTG
TCTGGTGGCG ATGACAGCAT CCGCCCTGAG GCGCCCTCGC AAGACACCAC ACCGGATCTG
GGAGCAGATT CCGCCACACG GCCCGAGGTT TCGGCGGCCT CAGACCTGCC ACTGCAGGCG
CCAGACGCGG AAATCCCCAC CCCCGATCTG TCGACCGAGG CCGATCCGGC GCCCCTGCGC
CAGGAGACGC TTCAAGTGGA GGCGCCAGAT ATCGGCACGC CTCCGGATGT GACCGCGTCG
CCCGTGTTGA CCACTCCGCC CGTGCGGTCG CTCACCCCTT CTGATCCGGT CGAGGACAAC
GCGGGGGCAA ATGGGGGCGT GAGAATCGCG GATCTGCCGC AGGCCTCCGA GGACCCAGAG
ACAAATGCAG GACCCAGTAT CGGCACGCGC GTGCTGCCAC TGACAGAACG CGACACCACG
GGCGCGGATG CGTCTTCTGC AGCAGACAGT AGTGCTGCGC CCTTTGTGCG CAACAGCGAG
CCGGCAAACC CGGAGCCGGG CCTGCCCTTG ATGTCGATTG TCCTCGTCGA AGAAGAAGGC
GCGGTGGGCG CCGAGGCGCT TGAGGATTTC CCCTATCCGC TGACGTTTAC CATCGACCCG
AGCGACCCGG ATGCGGTGGC ACGCATGAAA GCCCGGCGCG CGGCGGGGTT TGAGGTCATG
GTGTTGGCGG ATCTGCCCCG CGAAGGCCAG CCGCAGGACG CCGAAACCGC GATGCCGGTG
TGGTTTGACC GCCTTCCAGA GGCGGTGGGC ATCCTTGAGG GCATCGACAG CGGCGTGCAG
GGCAACCGGG CGCTTGCGGA TCAGGTGGCC AGCATCGCCG GTGATCTGGG CTATGGGCTG
GTGCTACAGG ACAATGGCCT GAACACGGTC CACAAAATGG CGCTGCGCGA TGGTATTCCT
TCGGGCGTGG TGTTTCGCGA CTTTGACGGC GCGGGTCAGG ACCCGCGCGC CATGCGCCGT
TTTCTGGACC AGGCCGCGTT CCGCTCCGGT CAGGAGGGCG CGGTCATCAT GCTGGGACGT
CTGAAGCCGG ACACGATTTC CGCGCTGCTG ATCTGGGGGC TGCAAGACCG CGCCAGCAGC
GTGGCGCTGG TGCCGATCTC GACCAGTCTC AAACGCCTGC TGGAGCCGGT CTCAAACTAA
 
Protein sequence
MGGAREKEHR REGETGMRGF LGGVSVGALV AVAGAAVWSL STPLPQAVDV SQELPVTTLQ 
PREVPAPPSD VPGSDADLVE AAPAEPDANA LGRDSAPDEI DTSLPERPSV STDPEVTLDG
APVQSDTPRI AVTSDADAPV SGGDDSIRPE APSQDTTPDL GADSATRPEV SAASDLPLQA
PDAEIPTPDL STEADPAPLR QETLQVEAPD IGTPPDVTAS PVLTTPPVRS LTPSDPVEDN
AGANGGVRIA DLPQASEDPE TNAGPSIGTR VLPLTERDTT GADASSAADS SAAPFVRNSE
PANPEPGLPL MSIVLVEEEG AVGAEALEDF PYPLTFTIDP SDPDAVARMK ARRAAGFEVM
VLADLPREGQ PQDAETAMPV WFDRLPEAVG ILEGIDSGVQ GNRALADQVA SIAGDLGYGL
VLQDNGLNTV HKMALRDGIP SGVVFRDFDG AGQDPRAMRR FLDQAAFRSG QEGAVIMLGR
LKPDTISALL IWGLQDRASS VALVPISTSL KRLLEPVSN