Gene Clim_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1543 
Symbol 
ID6354190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1663055 
End bp1664545 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content51% 
IMG OID642669149 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001943572 
Protein GI189347043 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAACC GAAACACCAC GAACACAACA AGAAAGCAAA GAACATTGCC GATCTACCAT 
CTTATCCCCG GAATTGGCCT TATCAGCAGG AAAAAACGAG GACTGGGCTT TTTTCTTTTT
TTCTCGTTTA TGATCTACCT GTTTATCCTG ATGCTGCAGG GAGACCTTGT CGTTATCAGT
TTCCGATCTC TGGTTTCCGC GACGTCGCTT ATTCTCCTTT CGCCTGAGAA ACTCCGCGAA
GTTTTCAGTA TCGAAATTAT TGAGTTCTGG ATTGCGTCCA TCGCTGTTAT CGCTGTTCCC
ATAGCACTCT TCATGGTTTC CAAAAAAACG GTTGCCGAAG AATCGCAACA AAAAGAAGGA
GAGAACCGGC ACGCATCGAG CCTTGGAAGA ATCAGTCTTC AGGCGTTCAT GCACAACACC
ATTGCCCAGA GCGCAGCCGT CGTGATCTTC GTACTCTATT CTGTGGCCTT TCTCGCGCCC
TTTCTCGCGC CCTTCAGCCC TTACGACCAG CAGGACTTTC TCGTTACTGC CTATAAACCG
CCATTTACGC GCTTGCAGGC ACTGGAACTC AAACAGCCGA AAATCCAGAG ACTGCCGGTA
CAGACTGCAA ACAACACTGC CGAAAAGCTC TCGAACTCCC TGATAAATGA TCTGCGAACA
TTGAAAACCC GTAACGAACC CCACGCACTG AAGTTCATCG ACGAATACCG CATCGAAAAC
GGGACGGTCT ATTATCGTCA GGGAATGCGT GCAAGAACCA TGTCGGTCAC AGAACTTGCT
GACAGGAAAG GCAAAGATCC GCTCATAACA AGAACATTTG CGCTTGGAAC CGACCAGTAT
GGACGCGATA TACTCAGCAG GGTAATCTAC GGCTCGCGAA TATCGCTCTC CATCGGCTTT
CTTGTCGTTA TGATCTCTGT AACCCTCGGC ACCGTTATAG GAATCTCATC CGGTTATTTC
GGCGGCTGGA TAGATGCTTT CCTCATGAGA ATTGTCGATG TGCTGATCGC TTTTCCTGCA
CTCTTCCTCA TTCTCATTAT CATCGCGACA TTCGGCAATT CCATTTACCT CATCGTCATT
ACGCTCTCGT TTACCGGCTG GATGGGGGTT GCACGCATCG TGAGAAGCCA GGTGCTCTCG
CTCAAGGAAC AGGAGTTTAT TCTGGCAGCA AAAGCTCTGG GCCTGTCGAG CATGAGAATC
ATCTTCCGCC ATCTCGCTCC GAACACCCTC ACACCCGTCA TCATTGCCGC AACACTCAGG
ATCGGCAGCA TCATTCTTAC AGAAGCCGGC CTCTCCTTCC TCGGTCTCGG TGTTCAGGCG
CCAACCCCGA GCTGGGGCAA CATCATCAAC GAAGGACGCG ACAGCCTTCT GAACCACTGG
TGGATCTCGA CGTTTCCCGG TATCGCCATT CTCACCACCG TTGTCTGTTT CAACCTCATC
GGAGATGGCG TCAGGGACGC CCTCGATCCC AGAATGCGAG GACATTCATG A
 
Protein sequence
MLNRNTTNTT RKQRTLPIYH LIPGIGLISR KKRGLGFFLF FSFMIYLFIL MLQGDLVVIS 
FRSLVSATSL ILLSPEKLRE VFSIEIIEFW IASIAVIAVP IALFMVSKKT VAEESQQKEG
ENRHASSLGR ISLQAFMHNT IAQSAAVVIF VLYSVAFLAP FLAPFSPYDQ QDFLVTAYKP
PFTRLQALEL KQPKIQRLPV QTANNTAEKL SNSLINDLRT LKTRNEPHAL KFIDEYRIEN
GTVYYRQGMR ARTMSVTELA DRKGKDPLIT RTFALGTDQY GRDILSRVIY GSRISLSIGF
LVVMISVTLG TVIGISSGYF GGWIDAFLMR IVDVLIAFPA LFLILIIIAT FGNSIYLIVI
TLSFTGWMGV ARIVRSQVLS LKEQEFILAA KALGLSSMRI IFRHLAPNTL TPVIIAATLR
IGSIILTEAG LSFLGLGVQA PTPSWGNIIN EGRDSLLNHW WISTFPGIAI LTTVVCFNLI
GDGVRDALDP RMRGHS