Gene Tcr_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0449 
Symbol 
ID3761476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp502351 
End bp503727 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content43% 
IMG OID637785160 
Productaromatic hydrocarbon degradation protein 
Protein accessionYP_390719 
Protein GI78484794 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAAGAATCGC TTTAGCAATT GCAACTGCTG CAATGGTTTC AACGCCTGTT 
CTAGCAACGA ACGGTACAAA CATGATTGGT TTAGGCGCAC AATCAAATGC AATGGGTGGT
ACAGGGGTAG CGGCTAGCTA CGGTGCAGAA ACAGTTATTG CTAACCCAGC CATGATCGGT
AAAACCACAG GAACTGAAAT GACTTTCGGC GGCACTTTGT TCCAACCATC CGTTGAAACA
ACAAACAATA TCAAAAACAG CGCTGTTGCT GCAGCTAATG GCATTACGTC TGCACAGGCT
GCTGCTGCCG GAGCACCGAT TGGAAGTGCA ACCAGTGATG CAGATACCAA TGTTATTCCA
GCAGTTTCTT TATCAAGCCG AATTAATGAC AAATTAACTT TCGGTATTGG TATGTTTGGT
ACATCAGGTA TGGGGGTTGA TTACCAAGAT GAAGACATGT TGTTTAATGC ACAAACAAAT
ATGCAAATCA TGAAGTTCGC TCCAACATTA GCGTTTAACT CATCAAAATT TGGGATTGGT
GTTTCGCCAA TTTTACAGTA TGGCTCATTA GACATTAACT ACCAAACTCA AATGACAAAT
AATTCTGGTG CACCAATGTA TGTTCAAGGC GGTGGAAACA TCAACTCAAC TCCATCGGCA
ACCCCTGCCA TGAAAACCGT AGGTCACGGC ATGGCTTCTG ACCTTGGAAT GGGATATAAC
ATTGGTGGTT ACTTTAATAT CACTAATGAC TTAACCGTAG CAGCGTCTTA CTTGTCGCCT
ATCAACATGA AATATAAAGG TCAACTATCT ACTGCATCAG GTGCGTTTGT TAACCCAGCC
TCTGACTTTA CAACACCGTT CAGCGACAAT CTTGAACAGC CTTCTGAAAT CAAGGTCGGT
GCGGAATACA TCATGGGAAG ATTCTCTGTC AACGCCGATT ATAAAAAAGT GGCTTGGGGA
TCTGCTAAAG GATATAAAGA CTTTGGCTGG GAGGACCAGG ACGTTTACTC TTTAGGTGCT
AAATTTGCTA CAAACAACTA TTGGTTAGGG GCAGGTTATA ACTACGGAAG CAACCCAATT
AAAGAACAAG ATGGAACAAG CTATAAAGGC GCTGTTATCA ACATGTTTAA CAACACGTTC
TTCCCTGCAA TCACAGAAAG TCACTTTACC TTTGGTGGTG GCTACTCTAT TACCAAGCAT
ACGACCTTAG AAGGGTCTGT TGTATATGCG CCAGAAGTTG AAAACTCTGT TGACACGACA
GCTGTCTCTT CAGCATTTGC ATCAGGAATG GCAGGCACAC CAACGGCAGC TGCTAGTACC
AGTACGACAA AGCATTCACA GATTGGCTAT ACAGTGCAGG TAAAATATAA TTTTTAA
 
Protein sequence
MKKTRIALAI ATAAMVSTPV LATNGTNMIG LGAQSNAMGG TGVAASYGAE TVIANPAMIG 
KTTGTEMTFG GTLFQPSVET TNNIKNSAVA AANGITSAQA AAAGAPIGSA TSDADTNVIP
AVSLSSRIND KLTFGIGMFG TSGMGVDYQD EDMLFNAQTN MQIMKFAPTL AFNSSKFGIG
VSPILQYGSL DINYQTQMTN NSGAPMYVQG GGNINSTPSA TPAMKTVGHG MASDLGMGYN
IGGYFNITND LTVAASYLSP INMKYKGQLS TASGAFVNPA SDFTTPFSDN LEQPSEIKVG
AEYIMGRFSV NADYKKVAWG SAKGYKDFGW EDQDVYSLGA KFATNNYWLG AGYNYGSNPI
KEQDGTSYKG AVINMFNNTF FPAITESHFT FGGGYSITKH TTLEGSVVYA PEVENSVDTT
AVSSAFASGM AGTPTAAAST STTKHSQIGY TVQVKYNF