Gene PCC8801_4414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4414 
Symbol 
ID7104860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4640364 
End bp4642394 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content45% 
IMG OID643477393 
ProductTrkA-N domain protein 
Protein accessionYP_002374492 
Protein GI218249121 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCCA GTGGAAACCC ATCTCCACAC CGTAATCTGC CTCAACCAGA CAGCGATCGC 
TTTTTAGTCT GTGGGTTAGG CAGTTTAGGA CAACATTGTG TCCTCTCCCT CAAGGAATTT
GGGGTAAAAG TAACAGCCAT AGAACAAATT GAACCGAAAA CCTGGGAAAT TCCTAATATC
CCTGAATTAC TCGACGATTT AATCATTGCC GACTGTAGAC AAAATCAGAT TTTACAACAG
GCAAAAATAG AACGCTGTCG CGCCGCCTTA TTAGTCACCA CCAACGAACA AGCGAACATC
GAAACCGCCT TAGCCATTCG TCAACTGAAC CCCCACACCC GCTTAATTGT TCGTTCTCCC
AAAGAAAACC TCAATCAACT GTTGAGTGAA CAACTGGGGA ACTTTATCGC CTATGAACCC
ACCCAACTAC CGGCCGCTGC CTTTGCGATC GCCGCGTTAG GAACACAAAC CCGTGGCTTT
TTTAGCCTTG ATCGCCAACA ATTGCGGGTC ATTCAGCGTC GTTTGACCCC CAATGATCCT
TGGTGTCATG TCCGTCCCTT ACATGACCTC AATACCCGAA ACCGTCGCTT AATCGGGTAT
CATGACGGAG AAGATCATCC CTCGGTGAGT TTCTACTATT GGGACCCAGA TACCGTTGTT
AAACCTGGAG ATCAACTGAT CTACATTGAA ACAACCGACA CCCTACTACA ACCTGTTTCA
ATGTCTTCGG TTCAATCTTC CCAACACCCT CAAAGACAAT TTTGGCAAAA TTGGCGCGAA
CGTCTCAAAA AGTTATGGCA AAAAGGACGA CAACGCATCC GACAAATTGC CTTGATCAGT
GGTTTAATTG TGATTGTTCT GTTAATTATT GGAACGTTGT TATTGCATTG GAATTTCCCC
CAAAGTACCT TACTTTCGGC TTTTTCTGCC ACGGCAATTT TATTATTAGG TGGATATTCT
GATCTGTTTG GAGAATTTGA ACAAATGGAT GATATTCCTC CCTGGTTGCA GTTATTTAGT
TTGGGGTTAA CCTTAGCAGG AACAGCTTTT GTCGGGGTTT TATATGCCTT ACTCACAGAA
ACCCTTTTAT CAGCGCAATT TCAGTTTGTT AAACAGCGTC CCCCCATTCC CCAAGCCAAT
CATATCCTCA TCATAGGACT AGGAAGAGTC GGCCAACAAG TGGCTGAGTT TCTATTGGAA
TTGAAACAAA CGTTGTTAGG AATTACCTTT AATTTAGAGT TAGATTCGAC TATTTTGCCA
GAAATGCCCC TCATTGTGGG GAATGTTCAA AACGTCCTGC CTCAAGCCAA TTTAGCCACC
GCTAAGAGTG TTGTTGTGGT GACGGATGAT GAAATTCTCA ACCTAGAAGT CGCCTTAATG
TCCCAAAAAC TGAACCCTGA CAGTCACATC GTCATTCGGA CAGCCGGACA AGCGTTAGGA
CAGCATTTAT TGCCCATTTT GCCAAAAGCC CAAATTTTGG GAACCTATGC GGTGGCCGCA
GAAGTGTTTG CCGGGGCAGC TTTCGGGGAA AATATTATTA CAGTCTTTCG CCTCAATAAT
CGGACGGTGT TGGTGACAGA ATACGAAGTT GAGGAAGAGG ATACCCTCAA TGGCTTGTTA
TTGGCAGAGA TTGCCTATGG CTATGGAGTT CTTCCTATCT TGCATCAAAA GCCCCCTAAT
GCCTCAAATT TGATGCCCTC CGATGATATT CGGTTGGGGG TAGGCGATCG CTTGGTCGTC
TTAGCTACCA TTGAAGACTT AAAGCGAGTT GAACAGGGAA AAATTGCCAT TCAACCCAAA
CAATGGCGCA TTAGAGTTGA AAAGGCGTTT AACGATGAGG CTGCTTTTGA GGGGGCGAAT
GCGATCGCTC GTATTTCGGG TTGTTCTTTG AATATAGCAC GAACACTCAT GGAACAGTTA
CCCGCGACTT TATCGGTTCC CCTTTATCAC CATCAAGGGT TACGATTAGT ACGCGAATTG
CATAAATTAC GGGTGACGTC AGCATTAATT CCGATTCAAG TAAGTCGTTA A
 
Protein sequence
MEPSGNPSPH RNLPQPDSDR FLVCGLGSLG QHCVLSLKEF GVKVTAIEQI EPKTWEIPNI 
PELLDDLIIA DCRQNQILQQ AKIERCRAAL LVTTNEQANI ETALAIRQLN PHTRLIVRSP
KENLNQLLSE QLGNFIAYEP TQLPAAAFAI AALGTQTRGF FSLDRQQLRV IQRRLTPNDP
WCHVRPLHDL NTRNRRLIGY HDGEDHPSVS FYYWDPDTVV KPGDQLIYIE TTDTLLQPVS
MSSVQSSQHP QRQFWQNWRE RLKKLWQKGR QRIRQIALIS GLIVIVLLII GTLLLHWNFP
QSTLLSAFSA TAILLLGGYS DLFGEFEQMD DIPPWLQLFS LGLTLAGTAF VGVLYALLTE
TLLSAQFQFV KQRPPIPQAN HILIIGLGRV GQQVAEFLLE LKQTLLGITF NLELDSTILP
EMPLIVGNVQ NVLPQANLAT AKSVVVVTDD EILNLEVALM SQKLNPDSHI VIRTAGQALG
QHLLPILPKA QILGTYAVAA EVFAGAAFGE NIITVFRLNN RTVLVTEYEV EEEDTLNGLL
LAEIAYGYGV LPILHQKPPN ASNLMPSDDI RLGVGDRLVV LATIEDLKRV EQGKIAIQPK
QWRIRVEKAF NDEAAFEGAN AIARISGCSL NIARTLMEQL PATLSVPLYH HQGLRLVREL
HKLRVTSALI PIQVSR