Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4396 |
Symbol | |
ID | 7104844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4618594 |
End bp | 4619928 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643477375 |
Product | nitrate transport protein |
Protein accession | YP_002374474 |
Protein GI | 218249103 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAAC TGTCTCGTCG CCAATTTATC GTGACTGCTG GTGCTGCTGC TGCTGGCACT GTGATCATTC ATGGTTGTTC TAGCGGTAGT GAAAATAATA CTACTCAATC TGGGTCTACT CCTCAACCCC AGGCCAGTCC CGTCACTAAC CTCAGTCCCG AAGAAATGCC AGAGGTGACG ACGGCCAAAC TCGGATTTAT TGCCTTAACC GACTCTACAC CCTTAATTAT TGCCAAAGAA AAAGGACTCT TTGATAAGTA TGGGATGACT GGGGTAGAAG TCCTCAAACA AGCCTCTTGG CCGGTTACTA GAGATAATTT GGAACTCGGT TCCGAGGGAG GTGGTATTGA TGGGGCTCAT ATTTTAACCC CCATGCCTTA CTTGATGACC TTGGGTAAGA TTACAAAACA ACCTGTTCCC ATGTATATTT TAGCCAGATT AAATGTTAAT GGCCAGGGAA TTTCTGTGAG TAAGGACTAT CTCGATTTAA AAGTGAGTTT AGATAGTTCT AAAATGAAAG AAGTTTTTAG CAAAGCCAAG GCTAATAAAA AAGAATTAAA TGCTGCCATG ACCTTCCCTG GAGGAACTCA CGATCTTTGG TTACGCTATT GGTTAGCAGC CGGGGGAATT GACCCCGAAA AAGACATTTC AGTTATTCCT GTACCCCCTC CTCAAATGGT TGCTAATATG AAAATTGGAG CCATGGAAAC CTTTTGTGTG GGTGAACCTT GGAATGCTCA ATTAGTCAAT CAGAAGCAAG GTTATACTGC TTTAGTCACT GGAGAATTGT GGAAAGATCA TCCTGAAAAA TCTTTTGCCT TACGCGCTGA TTGGGTGGAT AAAAATCCCA AAGCTGCTAA AGCTTTACTC AAAGCGGTAT TAGAAGCACA ACAATGGTGT GATAAGCCAG AAAATCATCA AGAAATGTGT GAAATTGTCG CTCAAGATAA GTGGTTTAAA GTCCCCGTTG AAGACATTAT TGGCAGAATA CACGGCACAA TTGATTATGG TGATGGACGG AAGGTAGAAA ATCCTGATAT TGCGATGAAG TTTTGGAAAG ATAATGCGTC TTATCCTTAT AAGAGTCATG ATTTATGGTT CTTAACTGAA GATATGCGTT GGGGTTATAT TCCGGCTGAT ACGGATACAA AAACCTTAGT TGATAAAGTC AATCGTTCTG ATTTATGGAA AGAAGCTGCT AAAGCAATTA AAGTGGCTGA TGCGGAAATT CCCACCAGTG ATTCTCGTGG AGTTGAAACC TTCTTTGATG GGGTGAAATT TGACCCCGCT AACCCCAAAG CTTATCTCGA TAGCCTGAAA ATTAAGAAAG CTTAG
|
Protein sequence | MSKLSRRQFI VTAGAAAAGT VIIHGCSSGS ENNTTQSGST PQPQASPVTN LSPEEMPEVT TAKLGFIALT DSTPLIIAKE KGLFDKYGMT GVEVLKQASW PVTRDNLELG SEGGGIDGAH ILTPMPYLMT LGKITKQPVP MYILARLNVN GQGISVSKDY LDLKVSLDSS KMKEVFSKAK ANKKELNAAM TFPGGTHDLW LRYWLAAGGI DPEKDISVIP VPPPQMVANM KIGAMETFCV GEPWNAQLVN QKQGYTALVT GELWKDHPEK SFALRADWVD KNPKAAKALL KAVLEAQQWC DKPENHQEMC EIVAQDKWFK VPVEDIIGRI HGTIDYGDGR KVENPDIAMK FWKDNASYPY KSHDLWFLTE DMRWGYIPAD TDTKTLVDKV NRSDLWKEAA KAIKVADAEI PTSDSRGVET FFDGVKFDPA NPKAYLDSLK IKKA
|
| |