Gene Avin_01780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01780 
Symbol 
ID7759140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp166855 
End bp168561 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content67% 
IMG OID643803099 
ProductNa/Pi cotransporter II protein 
Protein accessionYP_002797415 
Protein GI226942342 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter
[TIGR01013] Phosphate:Na+ Symporter (PNaS) Family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACGC TGCTTCACCT GCTGTCAGCC ATCGCCCTGC TCGTGTGGGG CACACATATC 
GTTCGTACCG GCATCCTGCG CGTCTATGGC CTGCAACTGC GCCAGTTGCT CAGCCACAGC
ATGCGCCGGC CATCCCTGGC TTTCCTCAGC GGCATGGGCG TCACCGCACT GGTGCAAAGC
AGTAACGCCA CCGCCATGCT GGCCAGTGCC TTCGTCGCCG AGGGGCTGAT GGCGCTGACT
CCGGCACTGG CAGCCATGCT GGGTGCGGAT GTCGGCACGG CCGTGATGGC GCGGGTGCTG
ACCCTGGATC TATCCTGGCT GTCGCCGCTG CTGCTGCTGT GCGGCGTCAG CCTGTTCCTC
GCCCAGAAGA GGAACCGTGC CGGCCAGCTC GGCCGGGTGG CGATCGGCCT CGGCCTGATC
ATGCTCGCCC TCGAACTGAT CGTCGTGGCC AGCGAACCCA TCACCCATGC CCAAGGACTG
GGCCTGTTGT TCGCCTCGCT GACCGGCGAC CCGCTGCTGG CCGCGGTGAT CGGCGCTCTG
TTCGCCATGC TCACCTATTC CAGCCTGGCC ACGGTGCTGC TCACCGCCAC CCTGGCCGGT
GCCGGACCGA TCGACCTGCC GCAGGCCATC GGCCTGGTGA TCGGCGCCAA CATCGGCAGC
GGCATGCTGG CCTACCTCAA CAGCAGCCTG CATGCCGCTG CCGGCCGACG GGTCGCCCTC
GGCAACCTGC TGTACAAGCT GCTCGGACTG CTGGTGCTAC CGCTGCTCGA TCCGCTGACG
GCCTGGATGC GAACGCTGCC GCTCAGCCTG CAGGACCAGG TGATCGGTTT CCACCTGGCC
TACAACAGCC TGCGCTGCCT GCTGCTGCTA CCCAGCGTCG CGCCGATGGC GCACCTGTGC
ACCCGGCTGC TGCCGGAGCG GATGACGGAA AGAAACGGTA CGGCCCAGCC GCGCTATCTC
GATCCGGAGG CCCTGCCGAC GCCGACCCTG GCGCTGGCCA ATGCGGTACG CGAAACCCTG
CGTATCGGCG ATCTGGTCGA ACAGATGCTC GGTCATCTAC AGGATGTGCT GCTGGGACAC
CGGGCCGAGG CGGGCCGCGA GATCCGCCGC CTCGAAGACG AGCTGGACAG GCTCTACGGA
GCGGTGAAGC TGTACCTGGC CAAGTTGCCG CGCCAATCGC TGGGCGACGC GGAAGACCGC
CGCTGGGCGG AGATCATCGA ACTGGCGGTC AATCTGCGCC AGGCCGGCTA CATCCTCGCC
AAGATGCAGC ACAGGGCCGA GCGGCGGAGC GTCACCCGTC CCGATCAGGA AGAGCTGACG
GAGCTGCATG GCGAACTTCT GGCCAATCTG CGCCTGGGAC TGAGCGTGTT CCTTTCCGGC
GACTCGCGCA GCGCGCGCCA ACTGCTGCGC CAGAAACGCC GTTTCCGCGC CCTGGAACGC
CATCTGGCGC ATGCCCATGT CGATCGCCTG CATCGCCAGC CTCTGCACAG TGCCGAAGTC
GGCTCGGCTC ATCTGGAGTT GCTGGAAGAC ATGAAGCGCC TCAATTCGCT GTTCTGCTGC
AGCGCCTATG TGGTGCTGGA GGCCGAGGCC CAGAACACCG ACTACCCGGA CGAAAGGCCC
GGACACGGCC GACAGGACGA CGAACTGCGC CGTCTGTTGA TCGACGATGC GGCGAACAGA
CCGGCGGGAG GCTCGGCGGC GGGCTGA
 
Protein sequence
MLTLLHLLSA IALLVWGTHI VRTGILRVYG LQLRQLLSHS MRRPSLAFLS GMGVTALVQS 
SNATAMLASA FVAEGLMALT PALAAMLGAD VGTAVMARVL TLDLSWLSPL LLLCGVSLFL
AQKRNRAGQL GRVAIGLGLI MLALELIVVA SEPITHAQGL GLLFASLTGD PLLAAVIGAL
FAMLTYSSLA TVLLTATLAG AGPIDLPQAI GLVIGANIGS GMLAYLNSSL HAAAGRRVAL
GNLLYKLLGL LVLPLLDPLT AWMRTLPLSL QDQVIGFHLA YNSLRCLLLL PSVAPMAHLC
TRLLPERMTE RNGTAQPRYL DPEALPTPTL ALANAVRETL RIGDLVEQML GHLQDVLLGH
RAEAGREIRR LEDELDRLYG AVKLYLAKLP RQSLGDAEDR RWAEIIELAV NLRQAGYILA
KMQHRAERRS VTRPDQEELT ELHGELLANL RLGLSVFLSG DSRSARQLLR QKRRFRALER
HLAHAHVDRL HRQPLHSAEV GSAHLELLED MKRLNSLFCC SAYVVLEAEA QNTDYPDERP
GHGRQDDELR RLLIDDAANR PAGGSAAG