Gene Gobs_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4122 
Symbol 
ID8755813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4336585 
End bp4337919 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003411056 
Protein GI284992502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGCTC GTCTCCGCCG CGTCCCGTTC GCGGCGCAGG TCCTCCTCGC CCTCGTCCTG 
GGCGTCGCCC TCGGCCTCGT CGCCCGGGAC ATGGGTCCCG TCGCCGACGG GACGCCGAAC
TGGCTGACCA GCACCCTGCA GACCGTCGGC AGCACCTTCG TCACGCTGCT CAAGGTCCTC
GTCCCGCCGC TGGTCGTCAC CGCGGTGATC GTCAGCATCG CCAACCTCCG GCAGGTCTCC
AACGCCGCTC GCCTGGCCGG ACAGACGCTG CTGTGGTTCG CGATCACCGC CCTCATCGCG
GTCTCCCTCG GCATCGGCCT GGGCCTGCTC ACCCAGCCGG GCCGCAACAG CTCCGTCGAC
GCGGCCGCCC AGGCCGCGCC CGAGACCACC GGTTCCTGGT GGGACTTCCT CACCGGCCTG
GTGCCCGCCA ACATCCTCGG TCTGCAGTCC TCGGCCGAGG GCGGGCTGTC CTTCAACGTG
CTCCAGCTGA TCGTGCTGGC CGTGGTGATC GGCGTGGCCG TGCTCAAGGT CGGCGAGCCG
GCCGAGCCGT TCCTCGCGCT GACCCGCTCG GCGCTGACGA TCGTCCAGAA GCTGCTGTGG
TGGGTCATCC TCCTGGCCCC GATCGGCACC GTCGGCCTGA TCGGCAACGC GGTGGCCAGC
TACGGCTGGG AGTCCCTCGG CTCGCTCGGC ATCTTCGCCG GGTCCGTCTA CGCCGGCCTC
GCGCTCGTGC TGTTCGTCGT CTACCCGGTG CTGCTGCAGC TGCACGGACT CTCGCCGCTG
CGGTACTTCG CCGGCGCCTG GCCGGCCATC CAGCTGGCCT TCGTCTCCCG CTCCTCGATC
GGCACGCTGC CGGTGACCGA GCGGGTGACC GAGCAGAACC TCGGCGTACC GCGGTCCTAC
GCGTCCTTCG CCGTCCCGCT GGGCGCTACT ACGAAGATGG ACGGCTGCGC GGCGATCTAC
CCGGCGCTGG CCGCGATCTT CGTGGCGCAG TTCTTCGGCG TGGACCTCGG CCTCACCGAC
TACCTGCTGA TCGCGCTGGT CTCGGTCGTC GGCTCGGCGG CCACGGCCGG GGTCACCGGC
GCGGTGGTCA TGCTGACGCT GACCCTCTCC ACGCTGGGCC TGCCGCTGGC CGGCGTCGGC
CTGCTCCTGG CGATCGACCC GATCCTGGAC ATGGGGCGGA CGGCGGTGAA CGTCGCCGGC
CAGGCGCTGG TCCCGACGAT CGTCGCCAAG CGCGAGGGAA TCCTCGACGT GGACCGGTAC
CGGTCGACCT CGACGGTGGA CCCGGTGGCC CGGGTCGAGG TGGACGACGA CCTGCGCCGT
CCCGTGACGG TCTAG
 
Protein sequence
MLARLRRVPF AAQVLLALVL GVALGLVARD MGPVADGTPN WLTSTLQTVG STFVTLLKVL 
VPPLVVTAVI VSIANLRQVS NAARLAGQTL LWFAITALIA VSLGIGLGLL TQPGRNSSVD
AAAQAAPETT GSWWDFLTGL VPANILGLQS SAEGGLSFNV LQLIVLAVVI GVAVLKVGEP
AEPFLALTRS ALTIVQKLLW WVILLAPIGT VGLIGNAVAS YGWESLGSLG IFAGSVYAGL
ALVLFVVYPV LLQLHGLSPL RYFAGAWPAI QLAFVSRSSI GTLPVTERVT EQNLGVPRSY
ASFAVPLGAT TKMDGCAAIY PALAAIFVAQ FFGVDLGLTD YLLIALVSVV GSAATAGVTG
AVVMLTLTLS TLGLPLAGVG LLLAIDPILD MGRTAVNVAG QALVPTIVAK REGILDVDRY
RSTSTVDPVA RVEVDDDLRR PVTV