Gene Clim_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1000 
Symbol 
ID6355449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1089638 
End bp1091317 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content57% 
IMG OID642668624 
ProductNa+/Picotransporter 
Protein accessionYP_001943055 
Protein GI189346526 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACC GGCTTTTCTT CCTTCCCATA GCGATCTCGT TTATCGGAGG GCTTGCGCTT 
TTTCTCTATG GCATCAGGGT GATGAGCGGA GGGCTGAAAA AGGCTGCAGG AAGCCGCATG
CGGGATTTCA TTGCCCGTAT CACCGATAAC AGGTTCTCCG GACTGCTTGC CGGTGCGCTT
GCCACCATGA TGGTACAGTC GAGCAGCACC ATCATGGTGA TGCTTGTCGG GCTGGTGCAG
TCGCAATTGA TGACCTACGT TCAGGCGCTC TCCATCATTC TCGGAGCCGA GATCGGTACT
ACGGCGATGG CGCAGCTTAT CTCATTCAGG ATTCATGAGT ACGGCCTGCC GTTTTTTGCC
ACCGGTTTTG CGCTCAACCT TCTGAGCCGG AAGGAAGCGC TCCGCAACGC CGGCGAGGCT
TTCTCCGGAT TCGGACTTTT TCTTTTCGGT ATGAGCATCA TGTCCGGAGC CGTGGCACCG
CTCCGTTCGT ACGGGCCGTT TCTCGAACTG ATCGGGTATC TCGAGAATCC GTTGTACGGC
GTGCTTGCCG GCATGCTGCT GACCGGTCTG ATCCAGAGCA GCGGGGCGTT CATCGTCATT
GTGATCACGC TTGCCCAACA GGGTTCTCTT TCGCTCGAAG CAGGGATTCC GCTGCTTCTC
GGCTCGAATA TCGGCACCTG CATTACGGTT TCCCTTGCAA GTCTCGGCAT GGTACGTTCG
GCAAAGCGGG TGGCTCTCGC CCAGGTGCTG TTCAATGTGT CCGGAGTGGC TGTTTTTCTT
TTTCTGATTC CCTGGTATGC CGATCTCGTG CGTATGATCT CTCCGTCCGA GGGAGTTCCG
GGAACTGTCA TTCCCCGGCA GATCGCCAAT GCGCATACGC TCTACAACGT GTTCATGGCC
GTCATGTTCC TGCCCTTGAT ACCGTTCTGG GCGAAACTGC TCATCCGCCT GATTCCCGAC
AGTCCCGAAG AAACCCGTCT GCAGCCTTCG GTCTGGTATA TTACCGGGAC GGCGCTCTCT
ACCCCTTCGC TTGCCCTGAG TTATGCAAGG GCCGAAACCT CCAGAATGAA CCGGATACTC
GAACGGATGG TTGGAGCTTC ACTTCCGGCC TTTACCGGCA GCGTAAAGGC AAAGGATACG
GTATTTCCGG ATCTTTCCGT TATCGGGGGA ATCAGAATGC GGGAGGAGAA GATCGATTTT
CTCGAATCGA AGGTTTCCGA CTATCTCATT GCCATCAGCC GGCAGGAGCT CGGGGAGCGG
GAGTCTCAGG AGGTGTTCGC GCTCATGACC ATCGTCAAGG ATCAGGAATC GATAGGGGAC
TGCATCGAGG CGCTTCTGCA GAAATTGCCG GAACGAACAG CGGAAAGCGC ATCCGGTCTG
ACGGCCGAAG GTATGGCAGA CCTGACGGCC CTCCATGCAT TTCTCTGCGG CGAAGTTGCC
GCGCTTACCG TCGCCGTTCA GGAGATGAGC GGCTCCAGGG CTTCGGGAAT CCTGCACGGC
GCCGTCGATT TTCCCGTACT TGCCGGCGCG GCGGAAGCCC GGCACCTGCA GCGGCTGCGC
ACGCTTCCCG AATCGGCTAT GACGCACGAT ATGCACATGG AGCTGCTGCA CGCGTTTGAA
GAGATGCACC ACTACTGCAA GAGTGTCGCG AGGAGTATCG TGAATGCGGA GGGACAGTAG
 
Protein sequence
MENRLFFLPI AISFIGGLAL FLYGIRVMSG GLKKAAGSRM RDFIARITDN RFSGLLAGAL 
ATMMVQSSST IMVMLVGLVQ SQLMTYVQAL SIILGAEIGT TAMAQLISFR IHEYGLPFFA
TGFALNLLSR KEALRNAGEA FSGFGLFLFG MSIMSGAVAP LRSYGPFLEL IGYLENPLYG
VLAGMLLTGL IQSSGAFIVI VITLAQQGSL SLEAGIPLLL GSNIGTCITV SLASLGMVRS
AKRVALAQVL FNVSGVAVFL FLIPWYADLV RMISPSEGVP GTVIPRQIAN AHTLYNVFMA
VMFLPLIPFW AKLLIRLIPD SPEETRLQPS VWYITGTALS TPSLALSYAR AETSRMNRIL
ERMVGASLPA FTGSVKAKDT VFPDLSVIGG IRMREEKIDF LESKVSDYLI AISRQELGER
ESQEVFALMT IVKDQESIGD CIEALLQKLP ERTAESASGL TAEGMADLTA LHAFLCGEVA
ALTVAVQEMS GSRASGILHG AVDFPVLAGA AEARHLQRLR TLPESAMTHD MHMELLHAFE
EMHHYCKSVA RSIVNAEGQ