Gene Francci3_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1339 
Symbol 
ID3906552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1607225 
End bp1608745 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content74% 
IMG OID637878672 
Productadenylylsulfate kinase 
Protein accessionYP_480445 
Protein GI86740045 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases 
TIGRFAM ID[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.881911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGG CGCTGCACCT CCGAACGACC ATTCCCCTGC CGGACCAGCC GGGGGGGCCG 
GACGGACCCC GGGTCCGGCT AGTCGGCACG GTCGAGGGCA TCCGGCTGCC CCACCACCCC
GATCATCCCG GCCTGCGGCT CACCCCGGCC CAGGTCCGGG CCGAGCTGCT CGCCCGAGGC
TGGATCAGCC TCGAGGCGGG TTCCGGCGCG TGCTGGGCGG TGTGGGCGGA CGGCCTGCTC
CACACGGCGG ACGTGGGCCG GATCCGGGCG CTGACCCGGC AGGGCAAACG GTGTGTCGTC
CTCGCCCCGG TGGGTGGCGC CGATCCCGCG GACGCCGAGC ATCACCTGCG GGTCCGTTGC
CTGCTCGCCG CTCTCAAGGC GGTTGACGCC CCGTTGCGGG CCGCCGAGGC GACGCTCTCC
TCGGAGGCGC TCGCCGCCGA CGCGATCAGC CCGCCGGGTG TTGCCGGCCG GCGGTCGGAA
CCGCCGCCAG ACCCCTACCG GCGGCCGAAA GCCGGCGAGG CGGGAACGCC GGAGCACCGC
AGCCTGCTGG TCCTCGTTCC GGTCATCCCG TCGGAGGCGC TGGCCACCCC CCGCGAACCG
GGCATGAGCG CGACGGTCCT GAAAACATCC GCGAAACCGA GCGACGCGGC GGCAGGGGTA
CCGGGCCCGG CCACGGCCGT GACGCCCGCC CCCCCAGCGC CGACGGACCT GGCGGACATC
GCGTGGGCGG CGGCGGGCGC GGCCTGGACA GGCGTGGAGA CGGCGGCGAG CTCCGCCGAC
GAACTGGCGA AACTCGCCGC CCAGCGGGCG CATCTGGCCA GCTTCTACGG CCTGACCGGC
AGCCTCATCG GGCCGGCGAT CGGGGCACCC GGGCGGATGG AGCTGGCCTC GCTGCTCGAC
GCTGGCAACC CGATACCGGC CGAACTGACC CCGCCGACGG TGGCCGCGGA GCTCTCCCGC
GCCATCCCAC CACGTTCGAA GCGGGGCCTG ACCGTGTTTC TCACCGGGCT GTCCGGCTCC
GGGAAGTCGA CGCTGGCCGG CCTGCTCGTC TGTCGACTGC TGGAATACGG CGGACGCCGG
CTCACTTTGC TCGACGGGGA CGTCGTGCGG ACCCACCTGT CCCAGGGACT GGGATTCTCC
CGCGCCGACC GCGATATCAA CGTCCGACGG ATCGGCTTCG TCGCGGCCCA GGTCGCGGGA
GCCGGAGGCA CCGCGGTGTG CGCGCCGATC GCGCCGTACG CCGACGTGCG CGCCCAGGTC
CGGGGGATGG TCCGGGCCGC AGGCGGCGGA TTCGTGCTGG TACACGTGTC CACACCGCTG
GAGGTGTGCG AGGCGCGGGA CCGCAAGGGG CTCTATGCCA AGGCGCGCGC AGGCGTCCTG
CCCGCCTTCA CCGGCGTGTC CGATCCGTAC GAGACGCCGA CCGACGCCGA CGTCACGGTG
AACACCGCGG AGCTCTCCGC CGAGGACGCC GTGGACCGGA TCATCGACCA CCTACGCCAC
GCCGGCTGGC TGACTAGCTG A
 
Protein sequence
MIAALHLRTT IPLPDQPGGP DGPRVRLVGT VEGIRLPHHP DHPGLRLTPA QVRAELLARG 
WISLEAGSGA CWAVWADGLL HTADVGRIRA LTRQGKRCVV LAPVGGADPA DAEHHLRVRC
LLAALKAVDA PLRAAEATLS SEALAADAIS PPGVAGRRSE PPPDPYRRPK AGEAGTPEHR
SLLVLVPVIP SEALATPREP GMSATVLKTS AKPSDAAAGV PGPATAVTPA PPAPTDLADI
AWAAAGAAWT GVETAASSAD ELAKLAAQRA HLASFYGLTG SLIGPAIGAP GRMELASLLD
AGNPIPAELT PPTVAAELSR AIPPRSKRGL TVFLTGLSGS GKSTLAGLLV CRLLEYGGRR
LTLLDGDVVR THLSQGLGFS RADRDINVRR IGFVAAQVAG AGGTAVCAPI APYADVRAQV
RGMVRAAGGG FVLVHVSTPL EVCEARDRKG LYAKARAGVL PAFTGVSDPY ETPTDADVTV
NTAELSAEDA VDRIIDHLRH AGWLTS