Gene Francci3_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3004 
Symbol 
ID3905501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3562045 
End bp3563139 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content72% 
IMG OID637880324 
Productarginine/ornithine transport system ATPase 
Protein accessionYP_482090 
Protein GI86741690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.487769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.192606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCCG GCATGGTGAC CCGACGACCG ATCGATCCGG GTGAGTACGC CGACGGGGTG 
CTGGCCTCGT CGCGGACCTG GATCGCGCGG GCCATCACCC TCGTCGAGTC CACCAGGCCC
GACCACCAGG AGCTCGCGCA GCAGCTGCTG GTGACGTTGC TGCCGCACGC GGGCAAGGCC
CGTCGGATCG GCATCACCGG GGTGCCCGGG GTGGGCAAGT CGACGTTCAT CGACGCGCTG
GGCACCATGC TCACCGGGAA GGGAAACCAG GTGGCGGTGC TCGCCGTCGA CCCCTCCTCG
ACGCGTACCG GCGGCAGCAT CCTGGGGGAC AAGACCCGGA TGGCGGCGCT GTCCACCGAT
CCGGCGGCGT TCATCCGGCC CTCGCCCACC GCGGGGACGC TCGGTGGCGT CGCCAAGGCG
ACCCGCGAGG CGATGGTCGT CATGGAGGCG GCGGGCTACG ACGTTGTCCT GGTCGAGACG
GTCGGCGTCG GCCAGTCGGA GACGACCGTC GCCGACATGG TCGACACCTT CCTGTTCCTG
ACCCTGGCCC GCACGGGGGA TCAGCTCCAG GGCATCAAGA AGGGGGTGCT GGAGCTCGCC
GACGTCATCG CGGTGAACAA GGCCGACGGG CCGCACGAGC TCGACGCCCG CCGTGCCGCC
CGGGAACTCG CCGGCGCGCT GCGGCTGCTG CAGGGCCCCG CCGGTCTCCG GGACTGGAAC
ACCCCGGTGC TGACGTGCAG CGCGGTGGAG CGGACGGGGC TGGACACTGT CTGGGCGCAG
ATCGTCAAGC ATCAGGACAC CCTCGACGCC TCCGGCGAAC TCGCCGCCCG CCGCCGCCGC
CAGCAGGTCG ACTGGATGTG GGCGATGGTC CACGATCGGC TGCTCGCGGG CCTGCGGGCC
GATCCGGGGG TCCGCGAGAT CACTCCGGAC CTTGAACGGC GGGTCCGCGA CGGCACCCTC
CCCCCGAGCC TGGCCGCCGA CGCCATCCTC GCCGCGCATG ATGGCCTGGC CGCCCACGAG
GGCCGGGCCG CGGCACCCTC CGACGCCGCG GAAAGCCGCG ACGGAAAGCC ACGACGGAAA
AGTGTGGAGG GTTAG
 
Protein sequence
MVSGMVTRRP IDPGEYADGV LASSRTWIAR AITLVESTRP DHQELAQQLL VTLLPHAGKA 
RRIGITGVPG VGKSTFIDAL GTMLTGKGNQ VAVLAVDPSS TRTGGSILGD KTRMAALSTD
PAAFIRPSPT AGTLGGVAKA TREAMVVMEA AGYDVVLVET VGVGQSETTV ADMVDTFLFL
TLARTGDQLQ GIKKGVLELA DVIAVNKADG PHELDARRAA RELAGALRLL QGPAGLRDWN
TPVLTCSAVE RTGLDTVWAQ IVKHQDTLDA SGELAARRRR QQVDWMWAMV HDRLLAGLRA
DPGVREITPD LERRVRDGTL PPSLAADAIL AAHDGLAAHE GRAAAPSDAA ESRDGKPRRK
SVEG