Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4435 |
Symbol | |
ID | 8393787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4585083 |
End bp | 4586468 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644982344 |
Product | nitrate transporter, putative |
Protein accession | YP_003140055 |
Protein GI | 257062167 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.210676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTC GTCAATTTAT TAAATATACC AGTCTGGGAG TCGCTAGTTT TGGAATAGCC GCTTGTACCC AGGGTAATCT GGATGTTTTT CGTCCCCAAG GGACATTGTC TAACCCAGAT AAAGCGTTTG GACCCTTAGA AAAGACTAAC CTAACGCTGG GTTTTATGGC AACCACCGAT GCAGCCCCCC TGATTATTGC CCAAGAAAAG GGCTTTTTTG AGCGTTATGG ACTCATTGTT ACCCTGAAAC GTCAACCTTC TTGGGAAGCG ATTGAAAACG ATCTATTAGA ATGGCGTTTA GATGCTGCCC AGACTCCATT TACCTTACCG ATGGAAGTCC AATTAGGCAA AAAACAAGCT CCTTTGATCG CTTTGATGAA TCTTAACCTA AATGGCAGCG CAATTACTAT TACCCAAAAA GCCTGGAAAA AGGGTGTAAG ACCTTCTATT GATTACAATA ATTTTAGCGA TTTTGAAAGG GGATTTCGTC AATATATTAG AAATCATAAT CAATCAGTCC GTTGGGGCAT AGATAGTCCA GTTTCTATGG ATGCCTATCT ATTACGTTAT TGGTTAAGTG CCATGGGAAT TGATCCAGAT CGAGAAATTG AATTATTAGA ATTTCCTGCT GCTCAATTGA ATTATAAATT ACAAGCAGGA ATGCTTGAAG GATATATTGC ATCAGCCCCT TGGACTCAAA CAGCGTTGTC AGAAGAAGCG GGATTTATTA GTTATATTAG TCGAGATATT TGGCAAGGTC ATCCTAATAA AATTTTAGCA GCTATGGACG GTTGGGTTAG AAAAAATCCA GCAACTGCCA GGGCTTTAAT GGCTGCTTTA TTAGAAGCTT GTCAATATTG CGATCGCCAA GGGCGCGCTT TTAGTGATCG CCATCGCTCT CAAACAGAAA TCCCTGAGTT ATTAGCTCAA CCTCAATATT TAAACTTGGA TAGTTCTTTG ATTAAATCTA CTTTAGGGGA AACGTATTTA TACAGCAATG AAGAATCAAC TAAAACCGTT GTTGAGATTC CTGATTTTAT CATTTTCAAC TACCAAGATA CTCCCTATCT GCAAACCCCC GATCACGCTA ATTATCCTTG GCGTTCCCAT GCTATTTGGC TATTAACCCA GATGATTCGC TGGAACAAAA ATGACCTGAA AACCTATCCG AAAGACGCTG ATAAATTGCT CGATAAAATC TATCCTATTT CTTTGTATGA AGAAGTTGCC AAAACTTTAA ATATCCCTAT TCCAAATGAT ACTCTAAAAA CAGAGAAAGC AACTGCTTTT ATTGACGGAC GCTCCTTTGA TCCGAGTGAA CCTGTTGCCT ATCTTAATCA ATTTTCTATT CGCGCCAGTC GTCCTCAAAT TGTCAGCTTT GTTTAG
|
Protein sequence | MKRRQFIKYT SLGVASFGIA ACTQGNLDVF RPQGTLSNPD KAFGPLEKTN LTLGFMATTD AAPLIIAQEK GFFERYGLIV TLKRQPSWEA IENDLLEWRL DAAQTPFTLP MEVQLGKKQA PLIALMNLNL NGSAITITQK AWKKGVRPSI DYNNFSDFER GFRQYIRNHN QSVRWGIDSP VSMDAYLLRY WLSAMGIDPD REIELLEFPA AQLNYKLQAG MLEGYIASAP WTQTALSEEA GFISYISRDI WQGHPNKILA AMDGWVRKNP ATARALMAAL LEACQYCDRQ GRAFSDRHRS QTEIPELLAQ PQYLNLDSSL IKSTLGETYL YSNEESTKTV VEIPDFIIFN YQDTPYLQTP DHANYPWRSH AIWLLTQMIR WNKNDLKTYP KDADKLLDKI YPISLYEEVA KTLNIPIPND TLKTEKATAF IDGRSFDPSE PVAYLNQFSI RASRPQIVSF V
|
| |