Gene Avin_12980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12980 
SymbolcysN 
ID7760240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1261845 
End bp1263746 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content62% 
IMG OID643804200 
Productbifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein 
Protein accessionYP_002798499 
Protein GI226943426 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCATC AATCCGAATT GATCAGCGAG GACATCCTCG CCTATCTGGC TCAGCACGAA 
CGCAAGGAAC TGCTGCGCTT TCTCACCTGT GGCAATGTCG ACGACGGCAA GAGCACCCTG
ATCGGTCGCC TGCTGCACGA TTCCAAGATG ATCTACGAGG ATCACCTGGA AGCAATCACC
CGCGATTCGA AGAAGGTCGG CACGACCGGC GATGAGGTCG ATCTGGCGCT GCTGGTCGAC
GGCCTGCAGG CCGAGCGCGA GCAGGGCATC ACCATCGACG TCGCCTATCG CTATTTCAGC
ACTGCCAAGC GCAAGTTCAT CATCGCCGAC ACCCCTGGCC ACGAGCAGTA CACCCGCAAC
ATGGCCACCG GCGCATCCAC CTGCGATTTG GCGATCATCC TCATCGACGC CCGCTACGGC
GTGCAGACCC AAACCCGCCG GCACAGTTTC ATCGCCTCGT TGCTGGGTAT CAGGCACATC
GTGGTCGCGG TCAACAAGAT GGATCTGATG GATTTCGATC AGGGTGTTTT CGAGCGCATC
CAGGCCGATT ACCTGCAGTT TGCCGAGCGC CTCGGCCTCA GGCCGAGTTC GCTGCATTTC
GTGCCGATGT CCGCGCTCAA GGGAGACAAC GTGGTCAACC GGAGCGAGCG TGCGCCCTGG
TACCAGGGGC CATCGCTGAT GGAGATCCTC GAGACCGTCG AAATCGCCGC CGATCGCAAT
CTGACCGACA TGCGTTTCCC GGTTCAGTAC GTCAACCGGC CGAACCTCAA CTTCCGTGGT
TTCGCCGGCA CCCTGGCCAG TGGTGTCGTG CACAAGGGGG ACGAGGTCGC CGTGTTGCCG
TCGGGCAAGA CCAGTCGAGT GCGTTCCATC GTCACCTATG ATGGTGAACT GGAGCAGGCG
ATTCCCGGTC AGGCGATCAC CTTGACTCTC GAGGACGAGA TCGACGTGTC GCGCGGCGAC
ATGCTGGTGC ATGCCGACAA TCGTCCACAG GTCGCCGACA GCTTCGAGGC CATGCTGGTA
TGGATGGCCG AGGAGCCGAT GCTGCCGGGC AAGAAGTACG ACATCAAGCG CGCCACCAGC
TATGTGCCGG GCAACATCGT CGCCATCGGG CACCGTATCG ACGTCAACAC CCTCGATCGT
GCACCGGCCA GCGAACTGAA GCTCAATGAG ATCGCTCGTG TTCGGGTCGG CCTGGATGCG
CCCATTGCCC TCGATGGTTA TGAGTACAAC CGCACCACCG GTGCCTTCAT CGTCATCGAT
CGGCTGACCA ACGGTACCGT TGGCGCCGGT ATGATCGTCG CCGAGCCTCC GTCCGGGCAG
AACGTCGGCG GTCATCATGG GCTGCTAGCC CATGTCAGCG CCGAAGAGCG TGCCGCACGC
TTCGGCCAGC GGCCGGCCAC GATTCTGTTC ACTGGCCTGT CAGGCGCCGG CAAGAGCACC
CTGGCCTATG CCCTGGAGCG CAAACTGTTC GATATGGGGC GCGCAGTATA CGTACTGGAT
GGGCAGAATC TGCGTCATGA TCTGAACAAG GGATTGCCAT TGGATCGTGC CGGGCGCGCC
GAGAACTGGC GGCGCGCCGC CCAGGTGGCT CGGCAGTTCA ACGAAGCCGG TCTACTGACG
CTGGCCGCCT TCGTCGCTCC GGATGCCGAG GGGCGTGCCC AGGCGCGGGC TCTGATCGGT
GTCGAGCGCC TGATCACCGT CTATGTACAG GCTTCCCCGC TGGTCTGCCG TGAGCGGGAT
CCACAGGGCC TCTATGCTGC GGGAGGCGAT CATATCCCCG GTGAGTCCTT CCCCTACGAC
ATACCGCTGG ATGCGGATCT GGTGGTCGAT ACCTTGCATC TTTCGGTGGA AGAGGGGGTG
AAGCAGGTGC TCGAGCTGTT GCGTAGTCGC GGCGCCCTTT AG
 
Protein sequence
MSHQSELISE DILAYLAQHE RKELLRFLTC GNVDDGKSTL IGRLLHDSKM IYEDHLEAIT 
RDSKKVGTTG DEVDLALLVD GLQAEREQGI TIDVAYRYFS TAKRKFIIAD TPGHEQYTRN
MATGASTCDL AIILIDARYG VQTQTRRHSF IASLLGIRHI VVAVNKMDLM DFDQGVFERI
QADYLQFAER LGLRPSSLHF VPMSALKGDN VVNRSERAPW YQGPSLMEIL ETVEIAADRN
LTDMRFPVQY VNRPNLNFRG FAGTLASGVV HKGDEVAVLP SGKTSRVRSI VTYDGELEQA
IPGQAITLTL EDEIDVSRGD MLVHADNRPQ VADSFEAMLV WMAEEPMLPG KKYDIKRATS
YVPGNIVAIG HRIDVNTLDR APASELKLNE IARVRVGLDA PIALDGYEYN RTTGAFIVID
RLTNGTVGAG MIVAEPPSGQ NVGGHHGLLA HVSAEERAAR FGQRPATILF TGLSGAGKST
LAYALERKLF DMGRAVYVLD GQNLRHDLNK GLPLDRAGRA ENWRRAAQVA RQFNEAGLLT
LAAFVAPDAE GRAQARALIG VERLITVYVQ ASPLVCRERD PQGLYAAGGD HIPGESFPYD
IPLDADLVVD TLHLSVEEGV KQVLELLRSR GAL