Gene Ava_0875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0875 
Symbol 
ID3681737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1070286 
End bp1071728 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content46% 
IMG OID637716209 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_321394 
Protein GI75907098 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0516063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00230539 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAATC ATAACTGGAC TAGAAGAGAT TTTATCAAAG GCGTAGGAGC CACCACCGCC 
GGCATTACCC TATCTGCCTG TAACCCTTCA GGAGATAGAT CCGCCACAGG CCTAACCCAA
GAAGCCCTAA CCATCAAGCC AGTAATCAAA TCCAGCGATT TAGAAAAACC CGATTTGATT
GTGGGATACG TTCCTGTAAA TGATTGTGCG CCATTTGCGA TCGCCTGGAA AAAAGGCTTT
TTTCGCAAGT ATGGCTTAAA CGTCCAACTC AACCGCGAAG CCAGTTGGGC TACCTCCCGC
GATGGCTTAA TTTTTGGTCG TCTTGATGCT GCACCTGTAG TATCTGGTGC AGTCACCAAC
GCCCGTATAG GTGCAGAAGG CGCACGTCAC GCGCCCTTAT GTGCAGCCAT GACAATCCAT
CGTCACGGTA ACGCCATGAC CATGAACAAA GCCATGTGGG ATTTTGGCTT GCGTCCTTGG
TATGAATATC AAGAAAAATA TGGCGATGGT GCATTAGAAG CCTTTGGACG GGACTTTCGG
GGTTACTTTG AGAAACAACC ACCAGAGAAC AAAGTTTGGG CTGTAGTTTT AAGTTCGGCA
ATTTACGAAT ACTTCGTCCG TTATGTATCG GCTGCGGCTG GTGTTGATCC CCTCAAAGAA
TTTCGCGTGA TTATTGTTCC ACCACCCCAG ATGGTGACCA ATGTACGAAT AGGGGCAATG
CAAGCATACA TGGTAGCAGA ACCTTGGAAT ACCAGAGCAA TCACAGGTAA CGAAGGCATT
GGTTTTACTT TTGCCCAAGG TAAAGAAGTC TGGCTGGGAC ACCCAGATAG ATTATTGGGA
GTAATGGAGT CTTTTATTGA TCAATACCCC AAAACCTACC GTTCCTTGGT CAAAGCCATG
ATTGAAGCTT GCCAATATTG CAGTAAACCG GAAAATCGCC AAGAAGTCGC TGAACTAATT
ACAGACCGTT CCTTCACAGG TGCAAGACCG AAAAATAAAA ATTTACCAAT CACTAAATTG
ACCGCACCAG GAATTATCGG TTCATACAAC TATGGCGGAT TTGATGGCAA AGACCGCACC
ATTCCCGCCG CAGACACGAC AATTTTCTAC GACATTCCCG ACAACCTGCC CAAACAACCA
GCCGAACACT CTACATTTTT ATGGAGATCC AGAAGCCTTT GGTTAATGAC TCAAGCCGCC
CGTTGGGGAC AAATCAAAGA ATTTCCCAAA AATGCTGAAC AATTAGCCGA AAAAGGCTGG
AGAACAGATT TATATCGCCA GATAGCCGCA GAAATGGGAA TTCAATGTCC CCAGGATGAT
TACAAAGTTG AGCCACCGGA AGTATTTATA GATAAGAAAG GTTTTGACCC CAGTGACCCT
GTTGGCTATT TGAATAGTTT TGCAATTAGG GCTAATGCGC CCACTCGTTT TTTCCTGTCT
TGA
 
Protein sequence
MSNHNWTRRD FIKGVGATTA GITLSACNPS GDRSATGLTQ EALTIKPVIK SSDLEKPDLI 
VGYVPVNDCA PFAIAWKKGF FRKYGLNVQL NREASWATSR DGLIFGRLDA APVVSGAVTN
ARIGAEGARH APLCAAMTIH RHGNAMTMNK AMWDFGLRPW YEYQEKYGDG ALEAFGRDFR
GYFEKQPPEN KVWAVVLSSA IYEYFVRYVS AAAGVDPLKE FRVIIVPPPQ MVTNVRIGAM
QAYMVAEPWN TRAITGNEGI GFTFAQGKEV WLGHPDRLLG VMESFIDQYP KTYRSLVKAM
IEACQYCSKP ENRQEVAELI TDRSFTGARP KNKNLPITKL TAPGIIGSYN YGGFDGKDRT
IPAADTTIFY DIPDNLPKQP AEHSTFLWRS RSLWLMTQAA RWGQIKEFPK NAEQLAEKGW
RTDLYRQIAA EMGIQCPQDD YKVEPPEVFI DKKGFDPSDP VGYLNSFAIR ANAPTRFFLS