Gene PCC8801_4396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4396 
Symbol 
ID7104844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4618594 
End bp4619928 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content42% 
IMG OID643477375 
Productnitrate transport protein 
Protein accessionYP_002374474 
Protein GI218249103 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAC TGTCTCGTCG CCAATTTATC GTGACTGCTG GTGCTGCTGC TGCTGGCACT 
GTGATCATTC ATGGTTGTTC TAGCGGTAGT GAAAATAATA CTACTCAATC TGGGTCTACT
CCTCAACCCC AGGCCAGTCC CGTCACTAAC CTCAGTCCCG AAGAAATGCC AGAGGTGACG
ACGGCCAAAC TCGGATTTAT TGCCTTAACC GACTCTACAC CCTTAATTAT TGCCAAAGAA
AAAGGACTCT TTGATAAGTA TGGGATGACT GGGGTAGAAG TCCTCAAACA AGCCTCTTGG
CCGGTTACTA GAGATAATTT GGAACTCGGT TCCGAGGGAG GTGGTATTGA TGGGGCTCAT
ATTTTAACCC CCATGCCTTA CTTGATGACC TTGGGTAAGA TTACAAAACA ACCTGTTCCC
ATGTATATTT TAGCCAGATT AAATGTTAAT GGCCAGGGAA TTTCTGTGAG TAAGGACTAT
CTCGATTTAA AAGTGAGTTT AGATAGTTCT AAAATGAAAG AAGTTTTTAG CAAAGCCAAG
GCTAATAAAA AAGAATTAAA TGCTGCCATG ACCTTCCCTG GAGGAACTCA CGATCTTTGG
TTACGCTATT GGTTAGCAGC CGGGGGAATT GACCCCGAAA AAGACATTTC AGTTATTCCT
GTACCCCCTC CTCAAATGGT TGCTAATATG AAAATTGGAG CCATGGAAAC CTTTTGTGTG
GGTGAACCTT GGAATGCTCA ATTAGTCAAT CAGAAGCAAG GTTATACTGC TTTAGTCACT
GGAGAATTGT GGAAAGATCA TCCTGAAAAA TCTTTTGCCT TACGCGCTGA TTGGGTGGAT
AAAAATCCCA AAGCTGCTAA AGCTTTACTC AAAGCGGTAT TAGAAGCACA ACAATGGTGT
GATAAGCCAG AAAATCATCA AGAAATGTGT GAAATTGTCG CTCAAGATAA GTGGTTTAAA
GTCCCCGTTG AAGACATTAT TGGCAGAATA CACGGCACAA TTGATTATGG TGATGGACGG
AAGGTAGAAA ATCCTGATAT TGCGATGAAG TTTTGGAAAG ATAATGCGTC TTATCCTTAT
AAGAGTCATG ATTTATGGTT CTTAACTGAA GATATGCGTT GGGGTTATAT TCCGGCTGAT
ACGGATACAA AAACCTTAGT TGATAAAGTC AATCGTTCTG ATTTATGGAA AGAAGCTGCT
AAAGCAATTA AAGTGGCTGA TGCGGAAATT CCCACCAGTG ATTCTCGTGG AGTTGAAACC
TTCTTTGATG GGGTGAAATT TGACCCCGCT AACCCCAAAG CTTATCTCGA TAGCCTGAAA
ATTAAGAAAG CTTAG
 
Protein sequence
MSKLSRRQFI VTAGAAAAGT VIIHGCSSGS ENNTTQSGST PQPQASPVTN LSPEEMPEVT 
TAKLGFIALT DSTPLIIAKE KGLFDKYGMT GVEVLKQASW PVTRDNLELG SEGGGIDGAH
ILTPMPYLMT LGKITKQPVP MYILARLNVN GQGISVSKDY LDLKVSLDSS KMKEVFSKAK
ANKKELNAAM TFPGGTHDLW LRYWLAAGGI DPEKDISVIP VPPPQMVANM KIGAMETFCV
GEPWNAQLVN QKQGYTALVT GELWKDHPEK SFALRADWVD KNPKAAKALL KAVLEAQQWC
DKPENHQEMC EIVAQDKWFK VPVEDIIGRI HGTIDYGDGR KVENPDIAMK FWKDNASYPY
KSHDLWFLTE DMRWGYIPAD TDTKTLVDKV NRSDLWKEAA KAIKVADAEI PTSDSRGVET
FFDGVKFDPA NPKAYLDSLK IKKA