Gene Cyan7425_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_3994 
Symbol 
ID7289941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp4027425 
End bp4029050 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content54% 
IMG OID643586965 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002484669 
Protein GI220909358 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000169157 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATTAAATTCC TGGGTTTAGC CCTGTTTTGT GCTGTGTTGC TGCTGGGCTG TAGTCCTTCC 
TCCTCTCCCC CCCCCTCTAG TGGGCGAGTC ACCCTCGGCA CCACTGCTAG AGTTCGCACC
CTTGATCCGG CAGATGCCTA CGAAGTGTTT GCTGGCACGT TGCTTTATAA CCTGGGCGAT
CGCCTCTATA CCTACAAAGG CACCACCCTG GTTCCCCAGC TAGCCACGGC TTTACCAATT
GTGAGTGAAG ATGGGCTGAC CTATCGCATC CCATTGCGCC AGGGTGTTCT CTTCCACGAT
GGCACCCGCT TCGATGCCAG GGCCATGGTC TTTTCGTTAG AGCGGTTTAT CAAAAATGAG
GGCCAGCCCT CAGCCCTGTT GGCGGGACGG GTGGAATCGA TCCAAGCGAC GGGGGAATAT
GAGCTGGAAA TTCGCTTAAA AAAACCGTTT GTTGCTTTTC CGGCCTTGCT GGCTTTCAGT
GGGCTGTGTG CCGTTTCCCC CAAAACCTAT GCGATCGGAG CCGGCCAATT TCTTCCAACC
CAATTTATCG GGACGGGCCC CTACAAACTC GTGCAACTGC GCAGTGATGC GATCCGCTTA
GAACCCTTTG CTGACTACTG GGGAACCAAA CCCGTCAATC AGGGGGTAGA TATTCAGGTC
TTTTCCAGTG GGGCCAACCT GTTTAATGCC TTTCGCACCG GAGCGGTGGA CATTGCCACC
CAATCGCTCG ATCCGAATCA GATTCAGGCG TTAATCCAGG GCAGTCAAAC CAAGGGCTGG
CAGGCGATCG CGGGTTCCAG TAACTCTATT ACTGTGCTGA CCTTAAATAC CCGTCAAGCC
CCCTGGGATC AGTTAGCCAC CCGTCAAGCC CTGGCTGCCC TGATCAATCG TCAGATTCTC
CAAAACCGTG TTTTTCAGGG GCAGGCCGAT CGCCTATTCA GCTTGATTCC CACCATTTTC
ACCGTCAGCC AGCCGGTCTT TCAGACCCAG TACGGGGATG GGCAGATTGA AACGGGGAAA
GAATTTCTCA GCCGGGCAGG GTATTCCGCA GCTCAACCCC TGAAAATCAA TCTCTGGTAT
CGCTCCAACG TCCCCAGCAA TGTTTTGGCA GCCACGGTGT TAAAGGCGGC AATCGAACGG
GATTGGGGCG AACTGGCAGC GGTAGAACTC AGCGGGGTGG AATCGGCAAC GGCTTATCAA
AATCTGGACA AGGGTGTTTA TCCTCTGATG ATGCTGGATT GGTACGGGGA TTTCTACGAC
CCAGATAACT ATATTGAACC GTTCCTGGCC TGCGAACAGG GCTCTGTCCA AACAGGGTGT
GAAGCCGGTG CCAGTGCCTC CTGGGGGTCT TTCTTCTATA GCAATCAAGC CAATCAATTG
ATCGATCAGC AACGTCGCCA GGCCGATCCC GCCGAACGCC AGCAACTGTT TGCCCAACTC
CAGGGAATTC TGGTTCAGAA TGTCCCATTT ATCCCCCTCT GGCAAAGTAA GAGTTATGTA
TTTGCCCAGA AGGAAATTCA AGGGGTGCAA TTGGAACCAA CCCAGCAGTT TCTCTTAACC
AGCATCAGCA AGTCAGGCAT CAGCAAGTCA GGCATCAGCA AGTCAGCGGG CCGCTCCAGC
CAGTAA
 
Protein sequence
MKFLGLALFC AVLLLGCSPS SSPPPSSGRV TLGTTARVRT LDPADAYEVF AGTLLYNLGD 
RLYTYKGTTL VPQLATALPI VSEDGLTYRI PLRQGVLFHD GTRFDARAMV FSLERFIKNE
GQPSALLAGR VESIQATGEY ELEIRLKKPF VAFPALLAFS GLCAVSPKTY AIGAGQFLPT
QFIGTGPYKL VQLRSDAIRL EPFADYWGTK PVNQGVDIQV FSSGANLFNA FRTGAVDIAT
QSLDPNQIQA LIQGSQTKGW QAIAGSSNSI TVLTLNTRQA PWDQLATRQA LAALINRQIL
QNRVFQGQAD RLFSLIPTIF TVSQPVFQTQ YGDGQIETGK EFLSRAGYSA AQPLKINLWY
RSNVPSNVLA ATVLKAAIER DWGELAAVEL SGVESATAYQ NLDKGVYPLM MLDWYGDFYD
PDNYIEPFLA CEQGSVQTGC EAGASASWGS FFYSNQANQL IDQQRRQADP AERQQLFAQL
QGILVQNVPF IPLWQSKSYV FAQKEIQGVQ LEPTQQFLLT SISKSGISKS GISKSAGRSS
Q