Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3730 |
Symbol | |
ID | 8393078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3817695 |
End bp | 3819098 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644981661 |
Product | nitrate transporter |
Protein accession | YP_003139377 |
Protein GI | 257061489 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0457947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.12705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACA AAAATTGGAC TAGACGGCAA GCCCTATTAG GGTTAGGGGG TCTTGCGGGG GCTGTGGCCT TTTCTTCGTG TGGCATTAAT ACAAACCGCG CCCCAAAAAG CCTGACAGAA GCTGCCCTTG CCGTTGATCA AGTCGTTAAA CCCGAAACCC TTGAAAAACC CAATCTCAAA ATTGGTTATG TTCCCGTCAA TGACTGTGCC CCCTTTGCCA TTGCTTGGGA AAAAGGCTTC TTTCACAAGT ATGGTTTAAA CGTCACCCTC AGTCGTGAGG CTAGTTGGGC AAACTCCCGT GATGGGGTAA TCTTTGGTCG TTTGGATGCG TCTCCTGTTG TTTCTGGGGC GGTTACTAAT GCTAGAATTG GGGCTGAAGG AGCCCGTCGA GCCCCTCTGT GTGCAGCCAT GACTATTCAT CGTCACGGCA ACGCCATGAC CATGAATCAA GGACTGTGGG ATGGAGGGAT TCGTCCTTGG AAAGAATATA AAGGAGATTT AGACGCATTT GGTCGAGATT TTAAGGACTA TTTTGCAAAA GCTCCGTCAG ACAAACGGGT GTGGGCGGTA GTACTCAGTT CAGCTATTTA CGAATACTTT ACCCGTTATG TAGTGGCAGC CGCTGGACTC AATCCTACTG AGGAATTTCG GATCATTATC ACTCCCCCTC CGCAAATGGT CAGTAATATG CGAATTGGCG CGATGCAAGC CTATATGGTG GCCGAACCCT GGAATACTCG CGCTATTTCG GGGAACGAGG GGATAGGCTT TACCTTTGCC CAAGGACGAG AAATCTGGCG GGGACATCCT GACCGAGTGT TGGCGGTAAC GGAGTCCTTT ATCGAGGAAA ATCCCAAAAC CTATCGATCG CTGGTGAAAG CCTTAATTGA AGCCTGTCAG TATTGCAGTA AGCCCGAAAA CCGCGAAGAA GTGGCTAAAA TTATCTCGAC TCGTCCCTTT ACGGGGGCAA AACCTCAATA CACACGACCT GGAATAGTTG GAGATTACAA CTACGGAGGA TTTGATGAGC AAAAACGGGT GGTTAATAGT CCAGAAACGA CGATTTTTTA CAATCTTCCT GAGGGAGTTT CTGCTGTTCC TCACGATCAT TCGACTTTTC TCTGGCAATC TCAAAGTCTC TGGTTAATGA CCCAAGCCAC TCGATGGCAA CAGATCCGAG AATTTCCCAA AAATGCTGAA AAAATTGCTC GTCAGGGTTG GAAAACGGAT TTGTATCGAG AAATTGCTGC TGAAATGGGG ATTAAATGTC CTTCCCAGGA TTACAAAGTT GAACCCGCCG AGGCTTTTAT TGATAATAAA GCCTTTGATC CGAGTGATCC GATTAACTAT CTCAACAGCT TTGAAATTCG CGCCAACGCG CCTCAATCTT TCTTCATGTC TTAA
|
Protein sequence | MNNKNWTRRQ ALLGLGGLAG AVAFSSCGIN TNRAPKSLTE AALAVDQVVK PETLEKPNLK IGYVPVNDCA PFAIAWEKGF FHKYGLNVTL SREASWANSR DGVIFGRLDA SPVVSGAVTN ARIGAEGARR APLCAAMTIH RHGNAMTMNQ GLWDGGIRPW KEYKGDLDAF GRDFKDYFAK APSDKRVWAV VLSSAIYEYF TRYVVAAAGL NPTEEFRIII TPPPQMVSNM RIGAMQAYMV AEPWNTRAIS GNEGIGFTFA QGREIWRGHP DRVLAVTESF IEENPKTYRS LVKALIEACQ YCSKPENREE VAKIISTRPF TGAKPQYTRP GIVGDYNYGG FDEQKRVVNS PETTIFYNLP EGVSAVPHDH STFLWQSQSL WLMTQATRWQ QIREFPKNAE KIARQGWKTD LYREIAAEMG IKCPSQDYKV EPAEAFIDNK AFDPSDPINY LNSFEIRANA PQSFFMS
|
| |