Gene Francci3_2170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2170 
Symbol 
ID3906770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2542215 
End bp2543240 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content74% 
IMG OID637879503 
ProductLAO/AO transport system ATPase 
Protein accessionYP_481269 
Protein GI86740869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.300615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.661007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCTCGT CGATCGAGTC GGTGACCGAA CTCGTCACCG CCGCCGTCGC GGGCAACCGC 
CGCGCGGTGG CGAGGCTCAT CTCCATGGTC GAGGACGGTT CACCGTACCT GCAGCGAACC
GAGCAGCTGC TGATGCCTCG CACCGGGGGC GCGCAGGTGA TCGGCCTCAC CGGGGCGCCG
GGGGTCGGCA AGTCCACCTC CACCTCCGCG CTGGTGGGCG CCTATCGGGC TGCGGGCCGG
CGGGTCGGCG TGCTGGCCGT CGACCCCTCG TCCCCGTTCA CCGGCGGCGC GCTGCTCGGC
GACCGGATCC GGATGGTCGA GCACGCCACC GATCCGGGCG TCTTCATCCG CTCGCTGGCG
ACGCGCGGAA GTCTCGGCGG CCTGTCGACC GCCACGCCCC AGGCCCTGCG GGTTCTCGAC
GCCGTCGGGT TCGACGTCGT CCTGATCGAG ACCGTCGGGG TGGGACAGGC CGAGGTGGAC
ATCGCCGCCC TGGCGGACAC CACGCTCGTC CTGGTCGCCC CAGGGATGGG GGACGGCGTC
CAGGCCGCCA AGGCGGGCGT TCTGGAGATC GCCGACGTGC TGGTGGTCAA CAAGGCCGAC
CGGCCGGGTG CGGACCGGAC GGTCAACGAG CTCACCGCCA TGCTCCGGAT GGGCGGTGCC
GGCCGCGGGG AGGACGACCG GCGTCCGCAG GTCGTGCGGA CGGTCGCCGC CACCGGGCAG
GGGGTGGCCG AGCTCGTCGA ATCGATCGAG GCCCATCGGG ACTGGCTGCT GCGCACGGGG
CAGCTGCAGC GCCGACGGCG TCGGCGCGCG GTCGACGAGA TCTCGCGGAT CGCCTTCGCC
CGGCTGCGGG AGCGGATCGG GGACCTGCGG CCCGGAGCCG CGATCGACCA GCTCGCCGAC
GAGGTCGTCG CCGGCGGACT CGACCCCCAC ACCGCGGCCG ATCGTCTCCT CGACGAGGCA
CTCGCCGCTC AGGCCGCCGT TGGCCTGGCC GCCGGTCACA CCGGTGCGGT GAAACCGACC
AGGTAA
 
Protein sequence
MGSSIESVTE LVTAAVAGNR RAVARLISMV EDGSPYLQRT EQLLMPRTGG AQVIGLTGAP 
GVGKSTSTSA LVGAYRAAGR RVGVLAVDPS SPFTGGALLG DRIRMVEHAT DPGVFIRSLA
TRGSLGGLST ATPQALRVLD AVGFDVVLIE TVGVGQAEVD IAALADTTLV LVAPGMGDGV
QAAKAGVLEI ADVLVVNKAD RPGADRTVNE LTAMLRMGGA GRGEDDRRPQ VVRTVAATGQ
GVAELVESIE AHRDWLLRTG QLQRRRRRRA VDEISRIAFA RLRERIGDLR PGAAIDQLAD
EVVAGGLDPH TAADRLLDEA LAAQAAVGLA AGHTGAVKPT R