Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0875 |
Symbol | |
ID | 3681737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1070286 |
End bp | 1071728 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637716209 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_321394 |
Protein GI | 75907098 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0516063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00230539 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAATC ATAACTGGAC TAGAAGAGAT TTTATCAAAG GCGTAGGAGC CACCACCGCC GGCATTACCC TATCTGCCTG TAACCCTTCA GGAGATAGAT CCGCCACAGG CCTAACCCAA GAAGCCCTAA CCATCAAGCC AGTAATCAAA TCCAGCGATT TAGAAAAACC CGATTTGATT GTGGGATACG TTCCTGTAAA TGATTGTGCG CCATTTGCGA TCGCCTGGAA AAAAGGCTTT TTTCGCAAGT ATGGCTTAAA CGTCCAACTC AACCGCGAAG CCAGTTGGGC TACCTCCCGC GATGGCTTAA TTTTTGGTCG TCTTGATGCT GCACCTGTAG TATCTGGTGC AGTCACCAAC GCCCGTATAG GTGCAGAAGG CGCACGTCAC GCGCCCTTAT GTGCAGCCAT GACAATCCAT CGTCACGGTA ACGCCATGAC CATGAACAAA GCCATGTGGG ATTTTGGCTT GCGTCCTTGG TATGAATATC AAGAAAAATA TGGCGATGGT GCATTAGAAG CCTTTGGACG GGACTTTCGG GGTTACTTTG AGAAACAACC ACCAGAGAAC AAAGTTTGGG CTGTAGTTTT AAGTTCGGCA ATTTACGAAT ACTTCGTCCG TTATGTATCG GCTGCGGCTG GTGTTGATCC CCTCAAAGAA TTTCGCGTGA TTATTGTTCC ACCACCCCAG ATGGTGACCA ATGTACGAAT AGGGGCAATG CAAGCATACA TGGTAGCAGA ACCTTGGAAT ACCAGAGCAA TCACAGGTAA CGAAGGCATT GGTTTTACTT TTGCCCAAGG TAAAGAAGTC TGGCTGGGAC ACCCAGATAG ATTATTGGGA GTAATGGAGT CTTTTATTGA TCAATACCCC AAAACCTACC GTTCCTTGGT CAAAGCCATG ATTGAAGCTT GCCAATATTG CAGTAAACCG GAAAATCGCC AAGAAGTCGC TGAACTAATT ACAGACCGTT CCTTCACAGG TGCAAGACCG AAAAATAAAA ATTTACCAAT CACTAAATTG ACCGCACCAG GAATTATCGG TTCATACAAC TATGGCGGAT TTGATGGCAA AGACCGCACC ATTCCCGCCG CAGACACGAC AATTTTCTAC GACATTCCCG ACAACCTGCC CAAACAACCA GCCGAACACT CTACATTTTT ATGGAGATCC AGAAGCCTTT GGTTAATGAC TCAAGCCGCC CGTTGGGGAC AAATCAAAGA ATTTCCCAAA AATGCTGAAC AATTAGCCGA AAAAGGCTGG AGAACAGATT TATATCGCCA GATAGCCGCA GAAATGGGAA TTCAATGTCC CCAGGATGAT TACAAAGTTG AGCCACCGGA AGTATTTATA GATAAGAAAG GTTTTGACCC CAGTGACCCT GTTGGCTATT TGAATAGTTT TGCAATTAGG GCTAATGCGC CCACTCGTTT TTTCCTGTCT TGA
|
Protein sequence | MSNHNWTRRD FIKGVGATTA GITLSACNPS GDRSATGLTQ EALTIKPVIK SSDLEKPDLI VGYVPVNDCA PFAIAWKKGF FRKYGLNVQL NREASWATSR DGLIFGRLDA APVVSGAVTN ARIGAEGARH APLCAAMTIH RHGNAMTMNK AMWDFGLRPW YEYQEKYGDG ALEAFGRDFR GYFEKQPPEN KVWAVVLSSA IYEYFVRYVS AAAGVDPLKE FRVIIVPPPQ MVTNVRIGAM QAYMVAEPWN TRAITGNEGI GFTFAQGKEV WLGHPDRLLG VMESFIDQYP KTYRSLVKAM IEACQYCSKP ENRQEVAELI TDRSFTGARP KNKNLPITKL TAPGIIGSYN YGGFDGKDRT IPAADTTIFY DIPDNLPKQP AEHSTFLWRS RSLWLMTQAA RWGQIKEFPK NAEQLAEKGW RTDLYRQIAA EMGIQCPQDD YKVEPPEVFI DKKGFDPSDP VGYLNSFAIR ANAPTRFFLS
|
| |