Gene Francci3_1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1342 
Symbol 
ID3906555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1611061 
End bp1612305 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content68% 
IMG OID637878675 
Productsulfate adenylyltransferase subunit 1 
Protein accessionYP_480448 
Protein GI86740048 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0489751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGT TGCTGCGGGT CGCGACGGCG GGGTCCGTCG ACGACGGGAA GTCGACCCTG 
ATCGGGCGGC TCCTCTACGA CACGAAGTCG CTGTTCTCCG ACCAGCTCGC CGCGGTCGAG
CGGACCAGTC GGGAGCGCGG CGACGGCTAC GTCGACCTCG CGCTGCTCAC CGATGGTCTG
CGGGCCGAGC GGGAGCAGGG CATCACGATC GACGTCGCGT ACCGTTACTT CGCCACGCCC
CGGCGGTCGT TCATCCTCGC TGACACCCCC GGGCACGTGC AGTACACCCG CAACATGGTC
ACCGGCGCGT CCACCGCCGA CCTCGCGCTG CTGCTCGTCG ACGCGCGCAA GGGGGTCGTC
GAGCAGACCC GCCGGCACGC CTTCCTGACG TCCTTGCTGC GCGTGCCCCA TCTGGTGCTC
TGTGTGAACA AGATGGACCT CGTCGACTTC GATCGGGACG TCTTCGAGAA GATCAAGGAG
GAGTTCCGGC ACTTCGCGGC GAAACTGGAG ATCGTCGACG TCACGACGAT CCCGATCTCC
GCGCTGGGCG GGGACAACGT GGTCTCCCGG TCGCCGAGCA CTCCCTGGTA CGGGGGCAGC
TCGCTGCTGC ACCACCTGGA GGAGCTCCAC ATCGCCTCCG ACCGCAACCT CATCGATGTG
CGCTTCCCGG TGCAGTGGGT GGTCCGGCCG CGGAACGACG ACCTGCACGA CTACCGCGGG
TACGCCGGTC AGGTCGCCGG CGGGGTGCTC AAGCCCGGCG ACGAGGTCGT CGTCCTGCCC
TCCGGGCTGA CCACGCGGAT CGTCGGCATC GACACGTTCG ACGGCCCGGT TGACGAGGCC
TTCCCGCCGA TGTCGGTGAC CATCCGGCTC GAGGACGATC TGGACGTCTC CCGGGGCGAC
ATGATCGCCC GGCCGATGAA CCAGCCGACC GTCGGCCAGG AACTCAACCT GATGGTCTCC
TGGATGGTCG ACGCCCCGCT GCGCCGCCGG GCCCGGATCG GGATCAAGCA CACCACCCGT
TCGGTACGGG CGATGGTCAC CGACATCTGC TACCGCGTCG ACATCGACAC CCTGCACCGC
GACGAGACCG TCGAGACGCT GGGTCTGAAC GACATCGGCC GGATCGCCCT GCGGACGACG
TCCCCGCTGT GCTACGACCT GTACCGCCGC AACCGGAACA CCGGCAGCGT GCTGCTCATC
GACGAGTCGA CCGGGACCAC TCTCGGCGCC GGGATGATCA TCTAG
 
Protein sequence
MAELLRVATA GSVDDGKSTL IGRLLYDTKS LFSDQLAAVE RTSRERGDGY VDLALLTDGL 
RAEREQGITI DVAYRYFATP RRSFILADTP GHVQYTRNMV TGASTADLAL LLVDARKGVV
EQTRRHAFLT SLLRVPHLVL CVNKMDLVDF DRDVFEKIKE EFRHFAAKLE IVDVTTIPIS
ALGGDNVVSR SPSTPWYGGS SLLHHLEELH IASDRNLIDV RFPVQWVVRP RNDDLHDYRG
YAGQVAGGVL KPGDEVVVLP SGLTTRIVGI DTFDGPVDEA FPPMSVTIRL EDDLDVSRGD
MIARPMNQPT VGQELNLMVS WMVDAPLRRR ARIGIKHTTR SVRAMVTDIC YRVDIDTLHR
DETVETLGLN DIGRIALRTT SPLCYDLYRR NRNTGSVLLI DESTGTTLGA GMII