Gene Gura_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3101 
Symbol 
ID5164167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3666680 
End bp3667816 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID640550589 
Productcysteine desulfurase family protein 
Protein accessionYP_001231839 
Protein GI148265133 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTATA TCAATAATGC TTCGACTTCT TCACCAAAAC CGGAAGCGGT TTACAAAGCC 
GTTGAACTGT GTATTCGGAC AAGTGGCATG AGTTCGGATA GAAGCTCATT TGTCTCCAAG
CTGGATTTTA TCCCTAACGA AACGAGGGCA TTGATAGCAA AGCTGATCAA TGCCAGTGAT
CCTACCCAAA TCGTATTCAC TATGAATGGT ACTGAGGCCT TGAATTTGGC TATCAAAGGT
ATCTTGAAGC GTGGGGACCA TGTTATCACC ACGAGCCTGG AACATAATTC CGTTATAAGG
CCTCTTAAGC ACTTGGAACA AGATGGGGAT ATTGAACTCA GCATTGTCCA GGCCAGTTCC
GAAGGTTTAT TGGATCCCAA TGACATTGTC CCGTTGATCA AAAGTAACAC TAAACTGATA
GTAACTGCCC ACATAACCAA TGTATTGGGA ACGACGATAC CTATCGAAGA AATCGGCAAA
ATTGCAGCTC AGCATAACAT AAAATACTTG GTGGACGCGG CTCAAAGTAT AGGTTTTGCC
GATATCGACG TCGAGAAAAT GAATATAGAT ATGCTCGCAT TCCCCGGCCA TAAGTCCTTA
TTTGGGCCTT CAGGCACTGG AGGATTGTAT ATAAAGAAAG GCATAGATCT CACACCGATT
AAGTATGGTG GCACCGGCAA TTTGTCCGAA CCGATTACGC AACCTGATTT CCTGCCTTAT
AAGTATGAAA GCGGAACCCC CAATACACTG GGTATATGTG GGCTTAATGC CGGGTTGAAA
TTTGTTGCCA GTGAAGGTGT GGCTAACATA AGAAAGCATG AGCATGAATT GGCTTGCATG
CTTTACGAGG AATTGTCGAC TATTAAAGGA GTGACACTCT ATGGTCCCAA GAGCCCTGCT
GAGATAACAT CTATCGTAGC GTTCAATGTC AAAGATAAAA ATCCTATGAA GGTGGCAAAT
ACGCTGATTA CTAAGTTCGG AATAATCACC AGGCCTGGGT TACACTGTGC CCCTCTGACA
CATCAAACCG TCGGAACTTG GAAAGATGGT TCGGTCCGGA TCAGCGCCGG GTATTTTAAT
ACAAAAGAAC ATATTGATGA GGTGGTGAAG GCTGTTGCTG CCATCACTGC TACCTAA
 
Protein sequence
MLYINNASTS SPKPEAVYKA VELCIRTSGM SSDRSSFVSK LDFIPNETRA LIAKLINASD 
PTQIVFTMNG TEALNLAIKG ILKRGDHVIT TSLEHNSVIR PLKHLEQDGD IELSIVQASS
EGLLDPNDIV PLIKSNTKLI VTAHITNVLG TTIPIEEIGK IAAQHNIKYL VDAAQSIGFA
DIDVEKMNID MLAFPGHKSL FGPSGTGGLY IKKGIDLTPI KYGGTGNLSE PITQPDFLPY
KYESGTPNTL GICGLNAGLK FVASEGVANI RKHEHELACM LYEELSTIKG VTLYGPKSPA
EITSIVAFNV KDKNPMKVAN TLITKFGIIT RPGLHCAPLT HQTVGTWKDG SVRISAGYFN
TKEHIDEVVK AVAAITAT